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MOLECULES FOR DIAGNOSTICS AND THERAPEUTICS 



TECHNICAL FIELD 

The present invention relates to human molecules and to the use of these sequences in the 
5 diagnosis, study, prevention, and treatment of diseases associated with, as well as effects of exogenous 
compounds on, the expression of human molecules. 



BACKGROUND OF THE INVENTION 

The human genome is comprised of thousands of genes, many encoding gene products that 
10 function in the maintenance and growth of the various cells and tissues in the body. Aberrant 

expression or mutations in these genes and their products is the cause of, or is associated with, a variety 
of human diseases such as cancer and other cell proliferative disorders, autoimmune/inflammatory 
disorders, infections, developmental disorders, endocrine disorders, metabolic disorders; neurological 
disorders, gastrointestinal disorders, transport disorders, and connective tissue disorders. The 
1 5 identification of these genes and their products is the basis of an ever-expanding effort to find markers 
for early detection of diseases, and targets for their prevention and treatment Therefore, these genes 
and their products are useful as diagnostics and therapeutics. These genes may encode, for example, 
enzyme molecules, molecules associated with growth and development, biochemical pathway molecules, 
extracellular information transmission molecules, receptor molecules, intracellular signaling molecules, 

2 o membrane transport molecules, protein modification and maintenance molecules, nucleic acid synthesis 

and modification molecules, adhesion molecules, antigen recognition molecules, secreted and 
extracellular matrix molecules, cytoskeletal molecules, ribosomal molecules, electron transfer 
associated molecules, transcription factor molecules, chromatin molecules, cell membrane molecules, 
and organelle associated molecules. 
25 For example, cancer represents a type of cell proliferative disorder that affects nearly every . 

tissue in the body. A wide variety of molecules, either aberrantly expressed or mutated, can be the 
cause of, or involved with, various cancers because tissue growth involves complex and ordered 
patterns of cell proliferation, cell differentiation, and apoptosis. Cell proliferation must be regulated to 
maintain both the number of cells and their spatial organization. This regulation depends upon the 

3 o appropriate expression of proteins which control cell cycle progression in response to extracellular 

signals such as growth factors and other mitogens, and intracellular cues such as DNA damage or . 
nutrient starvatioa Molecules which directly or indirectly modulate cell cycle progression fall into 
several categories, including growth factors and their receptors, second messenger and signal 
transduction proteins, oncogene products, tumor-suppressor proteins, and mitosis-promoting factors. 



1 



WO 01/62927 



PCIYUS01/06059 



Aberrant expression or mutations in any of these gene products can result in cell proliferative disorders 
such as cancer. Oncogenes are genes generally derived from normal genes that, through abnormal 
expression or mutation, can effect the transformation of a normal cell to a malignant one (oncogenesis). 
Oncoproteins, encoded by oncogenes, can affect cell proliferation in a variety of ways and include 
5 growth factors, growth factor receptors, intracellular signal transducers, nuclear transcription factors, 
and cell-cycle control proteins. In contrast, tumor-suppressor genes are involved in inhibiting cell 
proliferation. Mutations which cause reduced function or loss of function in tumor-suppressor genes 
result in aberrant cell proliferation and cancer. Although many different genes and their products have 
been found to be associated with cell proliferative disorders such as cancer, many more may exist that 

10 are yet to be discovered 

DNA-based arrays can provide a simple way to explore the expression of a single polymorphic 
gene or a large number of genes. When the expression of a single gene is explored, DNA-based arrays 
are employed to detect the expression of specific gene variants. For example, a p53 tumor suppressor 
gene array is used to determine whether individuals are carrying mutations that predispose them to 

15 cancer. A cytochrome p450 gene array is useful to determine whether individuals have one of a number 
of specific mutations that could result in increased drug metabolism, drug resistance or drug toxicity. 

DNA-based array technology is especially relevant for the rapid screening of expression of a 
large number of genes. There is a growing awareness that gene expression is affected in a global 

fashion. A genetic predisposition, disease or therapeutic treatment may affect, directly or indirectly, the ) 
2 o expression of a large number of genes. In some cases the interactions may be expected, such as when 
the genes are part of the same signaling pathway. In other cases, such as when the genes participate in 
separate signaling pathways, the interactions may be totally unexpected. Therefore, DNA-based arrays 
: can be used to investigate how genetic predisposition, disease, or therapeutic treatment affects the • 
expression of a large number of genes. 

25 ' 

Enzyme Molecules 

The cellular processes of biogenesis and biodegradation involve anumber of key enzyme 
classes including oxidoreductases, transferases, hydrolases, lyases, isomerases, andligases. These* 
enzyme classes are each comprised of numerous substrate-specific enzymes having precise and well 
30 regulated functions. These enzymes function by facilitating metabolic processes such as glycolysis, 
the tricarboxylic cycle, and fatty acid metabolism; synthesis or degradation of amino acids, steroids, 
phospholipids, alcohols, etc.; regulation of cell signalling, proliferation, inflamation, apoptosis, etc., 
and through catalyzing critical steps in DNA replication and repair, and the process of translation. 
Oxidoreductases 
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Many pathways of biogenesis and biodegradation require oxidoreductase (dehydrogenase or 
reductase) activity, coupled to the reduction or oxidation of a donor or acceptor cofactor. Potential 
cofactors include cytochromes, oxygen, disulfide, iron-sulfur proteins, flavin adenine dinucleotide 
(FAD), and the nicotinamide adenine dinucleotides NAD and NADP (Newsholme, E.A. and A.R. 
5 Leech (1983) Biochemistry for the Medical Sciences. John Wiley and Sons, Chichester, U.K., pp. 
779-793). Reductase activity catalyzes the transfer of electrons between substrate(s) and cofactor(s) 
with concurrent oxidation of the cofactor. Hie reverse dehydrogenase reaction catalyzes the reduction 
.of a cofactor and consequent oxidation of the substrate. Oxidoreductase enzymes are a broad 
superf amily of proteins that catalyze numerous reactions in all cells of organisms ranging from 

10 bacteria to.plants to humans. These reactions include metabolism of sugar, certain detoxification 
reactions in the liver, and the synthesis or degradation of fatty acids, amino acids, glucocorticoids, * 
. estrogens^ androgens, and prostaglandins. Different family members are named according to the 
direction in which their reactions are typically catalyzed; thus they may be .referred to as . 
oxidoreductases, oxidases, reductases, or dehydrogenases. In addition, family members often have 

1 5 distinct cellular localizations, including the cytosol, the plasma membrane, mitochondrial inner or 
outer membrane, and peroxisomes. 

Short-chain alcohol dehydrogenases (SCADs) are a family of dehydrogenases that only share 
15% to 30% sequence identity, with similarity predominantly in the coenzyme binding domain and 
the substrate binding domain. In addition to the well-known role in detoxification of ethanol, SCADs 

20 are also involved in synthesis and degradation of fatty acids, steroids, and some prostaglandins, and 
are therefore implicated in a variety of disorders such as lipid storage disease, myopathy, SCAD 
deficiency, and certain genetic disorders. For example, retinol dehydrogenase is a SCAD-family 
member (Simon, A. et al. (1995) J. Biol. Chem. 270:1107-1112) that converts retinol terminal, the • 
precursor of.retinoic acid. Retinoic acid, a regulator of differentiation and apoptosis, has been shown 

25 to down-regulate genes involved in cell proliferation and inflammation (Chai, X. et al: (1995) J. Biol. 
Chem. 270:3900-3904). In addition, retinol dehydrogenase has been linked to hereditary 6ye diseases 
such as autosomal recessive childhood-onset severe retinal dystrophy (Simon, A. et al: (1996) 
Genomics 36:424-430). 

Propagation of nerve impulses, modulation of cell proliferation and differentiation, induction 

30 of the immune response, and tissue homeostasis involve neurotransmitter metabolism (Weiss, B. 
(1991) Neurotoxicology 12:379-386; Collins, S.M. et al. (1992) Ann. N.Y. Acad. Sci. 664:415-424; 
Brown, J.K. and H. Imam (1991) J. Inherit Metab. Dis. 14:436-458). Many pathways of 
neurotransmitter metabolism require oxidoreductase activity, coupled to reduction or oxidation of a 
cofactor, such as NAD7NADH (Newsholme, E.A. and A.R. Leech (1983) Biochemistry for the 



3 



WO 01/62927 



PCT7US01/06059 



Medical Sciences. John Wiley and Sons, Chichester, U.K. pp. 779-793). Degradation of 
catecholamines (epinephrine or norepinephrine) requires alcohol dehydrogenase (in the hrain) or 
aldehyde dehydrogenase (in peripheral tissue). NAD + -dependent aldehyde dehydrogenase oxidizes 5- 
hydroxyindole-3-acetate (the product of 5-hydroxytryptamine (serotonin) metabolism) in the brain, ■ 
5 blood platelets, liver and pulmonary endothelium (Newsholme, supra, p. 786). Other 

neurotransmitter degradation pathways that utilize NADVNADH-dependent oxidoreductase activity 
include those of L-DOPA (precursor of dopamine, a neuronal excitatory compound), glycine (an 
, inhibitory neurotransmitter in the brain and spinal, cord), histamine (liberated from mast cells during 
; the inflammatory response), and taurine (an inhibitory neurotransmitter of the brain stem, spinal cord 
xo , ^nd retina) (Newsholme, supra , pp. 790, 792). Epigenetic or genetic defects in neurotransmitter 

metabolic pathways .can result in a spectrum of disease states in different tissues including Parkinson 
. . disease and inherited myoclonus (McGance. K.L. and S.E. Huether (1994) Pathophysiology. Mosbv- 
Year Book, Inc., St Louis MO, pp. 402-404; Gundlach, A.L. (1990) FASEB J. 4:2761-2766). 

Tetrahydrofolate is a derivatized glutamate molecule that acts as a carrier, providing activated 
15 . ; one-carbon units to a wide variety of biosynthetic reactions, including synthesis of purines, 
pyrimidines, and the amino acid methionine. Tetrahydrofolate is generated by the activity of a 
holoenzyme complex called tetrahydrofolate synthase, which includes three enzyme activities: 
tetrahydrofolate dehydrogenase, tetrahydrofolate cyciohydrolase, and tetrahydrofolate synthetase. 
Thus, tetrahydrofolate dehydrogenase plays an important role in generating building blocks for 
2 o nucleic and amino acids, crucial to proliferating cells. 

3-Hydroxyacyl-CoA dehydrogenase (3HACD) is involved in fatty acid metabolism. It 
catalyzes the reduction of 3-hydroxyacyl-CoA to 3-oxoacyl-CoA, with concomitant oxidation of 
NAD to NADH, in the mitochondria and peroxisomes of eukaryotic cells. In peroxisomes, 3HACD 
. and enoyl-CoA hydratase form an enzyme complex called bifunctional enzyme, defects in which are • 
2 5 associated with peroxisomal bifunctional enzyme deficiency. This intenuption in fatty acid ' 
. metabolism produces accumulation of very-long chain fatty acids, disrupting development of the 
brain, bone, and adrenal glands. Infants born with this deficiency typically die within 6 months 
(Watkins, P. et al. (1989) J. Clin. Invest 83:771-777; Online Mendelian Inheritance in Man (OMIM)* 
#261515). The neurodegeneration that is characteristic of Alzheimer's disease involves development 
30 of extracellular plaques in certain brain regions. A major protein component of these plaques is the 
peptide amyloid-P (AP), which is one of several cleavage products of amyloid precursor protein 
(APP). 3HACD has been shown to bind the AP peptide, and is overexpressed in neurons affected in 
Alzheimer's disease. In addition, an antibody against 3HACD can block the toxic effects of AP in a 
cell culture model of Alzheimer's disease (Yan, S. et al. (1997) Nature 389:689-695; OMIM, 
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#602057). 

Steroids, such as estrogen, testosterone, corticosterone, and others, are generated from a 
common precursor, cholesterol, and are interconverted into one another. A wide variety of enzymes 
act upon cholesterol, including a number of dehydrogenases. Steroid dehydrogenases, such as the 
5 hydroxysteroid dehydrogenases, are involved in hypertension, fertility, and cancer (Duax, W.L. and 
D. Ghosh (1997) Steroids 62:95-100). One such dehydrogenase is 3-oxo-5-oc-steroid dehydrogenase 
(OASD), a microsomal membrane protein highly expressed in prostate and other androgen-responsive 
tissues. OASD catalyzes the conversion of testosterone into dihydrotestosterone, which is the most 
potent androgen. Dihydrotestosterone is essential for the formation of the male phenotype during 

10 embryogenesis, as well as for proper androgen-mediated growth of tissues such as the prostate and 
male genitalia. A defect in OASD that prevents the conversion of testosterone into 
dihydrotestosterone leads to a rare form of male pseudohermaphroditis, characterized by defective 
formation of the external genitalia (Andersson, S. et al. (1991) Nature 354:159-161; Labrie, F. et al. 
(1992) Endocrinology 131:1571-1573; OMM #264600). Thus, OASD plays a central role in sexual 

15 differentiation and androgen physiology. 

17P-hydroxysteroid dehydrogenase (17PHSD6) plays an important role in the regulation of 
the male reproductive hormone, dihydrotestosterone (DHTT). 17PHSD6 acts to reduce levels of 
DHTT by oxidizing a precursor of DHTT, 3a-diol, to androsterone which is readily glucuronidated 
and removed from tissues. 17PHSD6 is active with both androgen and estrogen substrates when 

2 o expressed in embryonic kidney 293 cells. At least five other isozymes of 17 PHSD have been 
identified that catalyze oxidation and/or reduction reactions in various tissues with preferences for 
different steroid substrates (Biswas, M.G. and D.W. Russell (1997) J. Biol. Chem. 272:15959- 
15966). For example, 17PHSD1 preferentially reduces estradiol and is abundant in the ovary and 
placenta. 17PHSD2 catalyzes oxidation of androgens and is present in the endometrium and placenta. 

2 5 17PHSD3 is exclusively a reductive enzyme in the testis (Geissler, W.M. et al. (i994) Nat. Genet 

7:34-39). An excess of androgens such as DHTT can contribute to certain disease states such as 
benign prostatic hyperplasia and prostate cancer. 

Oxidoreductases are components of the fatty acid metabolism pathways in mitochondria and 
peroxisomes. The main beta-oxidation pathway degrades both saturated and unsaturated fatty acids, 

3 o while the auxiliary pathway performs additional steps required for the degradation of unsaturated fatty 

acids. The auxiliary beta-oxidation enzyme 2,4-dienoyl-CoA reductase catalyzes the removal of 
even-numbered double bonds from unsaturated fatty acids prior to their entry into the main beta- 
oxidation pathway. The enzyme may also remove odd-numbered double bonds from unsaturated 
fatty acids (Koivuranta, K.T. et al. (1994) Biochem. J. 304:787-792; Smeland, T.E. et al. (1992) Proc. 
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Natl. Acad. Sci. USA 89:6673-6677). 2,4-dienoyl-CoA reductase is located in both mitochondria and 
peroxisomes. Inherited deficiencies in mitochondrial and peroxisomal beta-oxidation enzymes are 
associated with severe diseases, some of which manifest themselves soon after birth and lead to death 
within a few years. Defects in beta-oxidation are associated with Reye's syndrome, Zellweger 
5 syndrome, neonatal adrenoleukodystrophy, infantile Refcum's disease, acyl-CoA oxidase deficiency, 
and bifunctional protein deficiency (Suzuki, Y. et al. (1994) Am. J. Hum. Genet 54:36-43; Hoefler, 
supra ; Cotran, RS. et al. (1994) Robbins Pathologic Basis of Disease . W.B. Saunders Co., 
Philadelphia PA, p.866). Peroxisomal beta-oxidation is impaired in cancerous tissue. Although 
neoplastic human breast epithelial cells have the same number of peroxisomes as do normal cells, 

1 o fatty acyl-CoA oxidase activity is lower than in control tissue (el Bouhtoury , F. et aL (1992) J. Pathol. 

166:27-35). Human colon carcinomas have fewer peroxisomes than normal colon tissue and have 
lower fatty-acyl-CoA oxidase and bifunctional enzyme (including enoyl-CoA hydratase) activities 
than normal tissue (Cable, S. et al. (1992) Virchows Arch. B Cell Pathol. Incl. Mol. Pathol. 62:221- 
226). Another important oxidoreductase is isocitrate dehydrogenase, which catalyzes the conversion 
15 of isocitrate to a-ketoglutarate, a substrate of the citric acid cycle. Isocitrate dehydrogenase can be 
either NAD or NADP dependent, and is found in the cytosol, mitochondria, and peroxisomes. 
Activity of isocitrate dehydrogenase is regulated developmental^, and by hormones, 
neurotransmitters, and growth factors. 

Hydroxypyruvate reductase (HPR), a peroxisomal 2-hydroxyacid dehydrogenase in the 

2 o glycolate pathway, catalyzes the conversion of hydroxypyruvate to glycerate with the oxidation of 

both NADH and NADPH. The reverse dehydrogenase reaction reduces NAD* and NADP\ HPR 
recycles nucleotides and bases back into pathways leading to the synthesis of ATP and GTP. ATP 
and GTP are used to produce DNA and RNA and to control various aspects of signal transduction and 
energy metabolism. Inhibitors of purine nucleotide biosynthesis have long been employed as 
. 2 5 antiproliferative agents to treat cancer and viral diseases. HPR also regulates biochemical synthesis 
of serine and cellular serine levels available for protein synthesis. 

The mitochondrial electron transport (or respiratory) chain is a series of oxidoreductase-type 
enzyme complexes in the mitochondrial membrane that is responsible for the transport of electrons 
from NADH through a series of redox centos within these complexes to oxygen, and the coupling of 
30 this oxidation to the synthesis of ATP (oxidative phosphorylation). ATP then provides the primary 
source of energy for driving a cell's many energy-requiring reactions. The key complexes in the 
respiratory chain are NADHnibiquinone oxidoreductase (complex I), succinate:ubiquinone 
oxidoreductase (complex II), cytochrome Cj-b oxidoreductase (complex HI), cytochrome c oxidase 
(complex IV), and ATP synthase (complex V) (Alberts, B. et al. (1994) Molecular Biology of the 
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Cell. Garland Publishing, Inc., New York NY, pp. 677-678). All of these complexes are located on 
the inner matrix side of the mitochondrial membrane except complex n, which is on the cytosolic 
side. Complex n transports electrons generated in the citric acid cycle to the respiratory chain. Hie 
electrons generated by oxidation of succinate to fumarate in the citric acid cycle are transferred 
5 through electron carriers in complex n to membrane bound ubiquinone (Q). Transcriptional 
regulation of these nuclear-encoded genes appears to be the predominant means for controlling the 
biogenesis of respiratory enzymes. Defects and altered expression of enzymes in the respiratory chain 
are associated with a variety of disease conditions. 

Other dehydrogenase activities using NAD as a cofactor are also important in mitochondrial 

10 function. 3-hydroxyisobutyrate dehydrogenase (3HBD), important in valine catabolism, catalyzes the 
NAD-dependent oxidation of 3-hydroxyisobutyrate to methylmalonate semialdehyde within 
mitochondria. Elevated levels of 3-hydroxyisobutyrate have been reported in a number of disease 
states, including ketoacidosis, methylmalonic acidemia, and other disorders associated with 
deficiencies in methylmalonate semialdehyde dehydrogenase (Rougraff, P.M. et al. (1989) J. Biol. 

15 Chem. 264:5899-5903). 

Another mitochondrial dehydrogenase important in amino acid metabolism is the enzyme 
isovaleryl-CoA-dehydrogenase (TVD). IVD is involved in leucine metabolism and catalyzes the 
oxidation of isovaleryl-CoA to 3-methylcrotonyl-CoA Human IVD is a tetrameric flavoprotein that 
is encoded in the nucleus and synthesized in the cytosol as a 45 kDa precursor with a mitochondrial 

2 o import signal sequence. A genetic deficiency, caused by a mutation in the gene encoding IVD, results 
in the condition known as isovaleric acidemia. This mutation results in inefficient mitochondrial 
import and processing of the IVD precursor (Vockley, J. et al. (1992) J. Biol. Chem. 267:2494-2501). 
Transferases 

Transferases are enzymes that catalyze the transfer of molecular groups. The reaction may 

2 5 involve an oxidation, reduction, or cleavage of covalent bonds, and is often specific to a substrate or 

to particular sites on a type of substrate. Transferases participate in reactions essential to such 
functions as synthesis and degradation of cell components, regulation of cell functions including cell 
signaling, cell proliferation, inflamation, apoptosis, secretion and excretion. Transferases are 
involved in key steps in disease processes involving these functions. Transferases are frequently 

3 o classified according to the type of group transferred For example, methyl transferases transfer one- 

carbon methyl groups, amino transferases transfer nitrogenous amino groups, and similarly 
denominated enzymes transfer aldehyde or ketone, acyl, glycosyl, alkyl or aryl, isoprenyl, saccharyl, 
phosphorous-containing, sulfur-containing, or selenium-containing groups, as well as small 
enzymatic groups such as Coenzyme A 
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Acyl transferases include peroxisomal carnitine octanoyl transferase, which is involved in the 
fatty acid beta-oxidation pathway, and mitochondrial carnitine palmitoyl transferases, involved in 
fatty acid metabolism and transport Choline O-acetyl transferase catalyzes the biosynthesis of the 
neurotransmitter acetylcholine. 
5 Amino transferases play key roles in protein synthesis and degradation, and they contribute to 

other processes as well. For example, the amino transferase 5-aminolevulinic acid synthase catalyzes 
the addition of succinyl-CoA to glycine, the first step in heme biosynthesis. Other amino transferases 
participate in pathways important for neurological function and metabolism. For example, glutamine- 
phenylpyruvate amino transferase, also known as glutamine transaminase K (GTK), catalyzes several 

1 o reactions with a pyridoxal phosphate cof actor. GTK catalyzes the reversible conversion of L- 

glutamine and phenylpyruvate to 2-oxoglutaramate and L-phenylalanine. Other amino acid substrates 
for GTK include L-methioiiine, L-histidine, andL-tyrosinc. GTK also catalyzes the conversion of 
kynurenine to kynurenic acid, a tryptophan metabolite that is an antagonist of the N-methyl-D- 
* aspartate (NMDA) receptor in the brain and may exert a neuromodulatory function. Alteration of the 

1 5 kynurenine metabolic pathway may be associated with several neurological disorders. GTK also 
plays a role in the metabolism of halogenated xenobiotics conjugated to glutathione, leading to 
nephrotoxicity in rats and neurotoxicity in humans. GTK is expressed in kidney, liver, and brain. 
Both human and rat GTKs contain a putative pyridoxal phosphate binding site (ExPASy ENZYME: 
EC 2.6.1.64; Perry, S.J. et al. (1993) Mol. Pharmacol. 43:660-665; Perry, S. et al. (1995) FEBS Lett 

20 360:277-280; and Alberati-Giani, D. et al. (1995) J. Neurochem. 64:1448-1455). A second amino 
transferase associated with this pathway is kynurenine/a-aminoadipate amino transferase (AadAT). 
AadAT catalyzes the reversible conversion of oc-aminoadipate and a-ketoglutarate to cc-ketoadipate 
and L-glutamate during lysine metabolism. AadAT also catalyzes the transamination of kynurenine 
to kynurenic acid. A cytosolic AadAT is expressed in rat kidney, liver, and brain (Nakatani, Y. et al. 

25 (1970) Biochim. Biophys. Acta 198:219-228; Buchli, R. et al. (1995) J. Biol. Chem. 270:29330- 
29335). 

Glycosyl transferases include the mammalian UDP-glucouronosyl transferases, a family of 
membrane-bound microsomal enzymes catalyzing the transfer of glucouronic acid to lipophilic 
substrates in reactions that play important roles in detoxification and excretion of drugs, carcinogens, 
30 and other foreign substances. Another mammalian glycosyl transferase, mammalian UDP-galactose- 
ceramide galactosyl transferase, catalyzes the transfer of galactose to ceramide in the synthesis of 
galactocerebrosides in myelin membranes of the nervous system. The UDP-glycosyl transferases 
share a conserved signature domain of about 50 amino acid residues (PROSITE: PDOC00359, 
http://expasyJicugexh/sprot/prositehtml), 
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Methyl transferases are involved in a variety of pharmacologically important processes. 
Nicotinamide N-methyl transferase catalyzes the N-methylation of nicotinamides and other pyridines, 
an important step in the cellular handling of drugs and other foreign compounds. 
Phenylethanolamine N-methyl transferase catalyzes the conversion of noradrenalin to adrenalin. 6-0- 
5 methylguanine-DNA methyl transferase reverses DNA methylation, an important step in 

carcinogenesis. Uroporphyrin-in C-methyl transferase, which catalyzes the transfer of two methyl 
groups from S-adenosyl-L-methionine to uroporphyrinogen HI, is the first specific enzyme in the 
biosynthesis of cobalamin, a dietary enzyme whose uptake is deficient in pernicious anemia, Protein- 
arginine methyl transferases catalyze the postradiational methylation of arginine residues in 

10 proteins, resulting in the mono- and dimethylation of arginine on the guanidino group. Substrates 
include histones, myelin basic protein, and heterogeneous nuclear ribonucleoproteins involved in 
mRNA processing, splicing, and transport Protein-argMne methyl transferase interacts with proteins 
upregulated by mitogens, with proteins involved in chronic lymphocytic leukemia, and with 
interferon, suggesting an important role for methylation in cytokine receptor signaling (Lin, W.J. et 

is al. (1996) J. Biol. Chem. 271:15034-15044; Abramovich, C. et aL (1997) EMBO J. 16:260-266; and 
Scott, H.S. et al. (1998) Genomics 48:330-340). 

Phosphotransferases catalyze the transfer of high-energy phosphate groups and are important 
in energy-requiring and -releasing reactions. The metabolic enzyme creatine kinase catalyzes the 
reversible phosphate transfer between creatine/creatine phosphate and ATP/ADP. Glycocyamine 

2 o kinase catalyzes phosphate transfer from ATP to guanidoacetate, and arginine kinase catalyzes 

phosphate transfer from ATP to arginine. A cysteine-containing active site is conserved in this family 
(PROSITE:PDOC00103). 

Prenyl transferases are heterodimers, consisting of an alpha and a beta subunit, that catalyze 
the transfer of an isoprenyl group. An example of a prenyl transferase is the mammalian protein 

2 5 f arnesyl transferase. The alpha subunit of f arnesyl transferase consists of 5 repeats of 34 amino acids 
each, with each repeat containing an invariant tryptophan (PROSITE: PDOC00703). 

Saccharyl transferases are glycating enzymes involved in a variety of metabolic processes. 
Oligosacchryl transferase-48, for example, is a receptor for advanced glycation endproducts. 
Accumulation of these endproducts is observed in vascular complications of diabetes, macrovascular 

30 disease, renal insufficiency, and Alzheimer's disease (Thornalley, P.J. (1998) Cell Mol. Biol. (Noisy- 
Le-Grand) 44:1013-1023). 

Coenzyme A (CoA) transferase catalyzes the transfer of CoA between two carboxylic acids. 
Succinyl CoA:3-oxoacid CoA transferase, for example, transfers CoA from succinyl-CoA to a 
recipient such as acetoacetate. Acetoacetate is essential to the metabolism of ketone bodies, which 
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accumulate in tissues affected by metabolic disorders such as diabetes (PROSITE: PDOC00980). 
Hydrolases 

Hydrolysis is the breaking of a covalent bond in a substrate by introduction of a molecule of 
water. The reaction involves a nucleophilic attack by the water molecule's oxygen atom on a target 
bond in the substrate. The water molecule is split across the target bond, breaking the bond and 
generating two product molecules. Hydrolases participate in reactions essential to such functions as 
synthesis and degradation of cell components, and for regulation of cell functions including cell 
signaling, cell proliferation, inflamation, apoptosis, secretion and excretioa Hydrolases are involved 
in key steps in disease processes involving these functions. Hydrolytic enzymes, or hydrolases, may 
be grouped by substrate specificity into classes including phosphatases, peptidases, 
lysophospholipases, phosphodiesterases, glycosidases, and glyoxalases. 

Phosphatases hydroiyticaily remove phosphate groups from proteins, an energy-providing 
step that regulates many cellular processes, including intracellular signaling pathways that in turn 
control cell growth and differentiation, cell-cell contact, the cell cycle, and oncogenesis. 

Lysophospholipases (LPLs) regulate intracellular lipids by catalyzing the hydrolysis of ester 
bonds to remove an acyl group, a key step in lipid degradation. Small LPL isofoims, approximately 
15-30 kD, function as hydrolases; larger isoforms function both as hydrolases and transacylases. A 
particular substrate for LPLs, lysophosphatidylcholine, causes lysis of cell membranes. LPL activity 
is regulated by signaling molecules important in numerous pathways, including the inflammatory 
response. 

Peptidases, also called proteases, cleave peptide bonds that form the backbone of peptide or 
protein chains. Proteolytic processing is essential to cell growth, differentiation, remodeling, and 
homeostasis as well as inflammation and immune response. Since typical protein half-lives range 
from hours to a few days, peptidases are continually cleaving precursor proteins to their active form, 
removing signal sequences from targeted proteins, and degrading aged or defective proteins. 
Peptidases function in bacterial, parasitic, and viral invasion and replication within a host. Examples 
of peptidases include trypsin and chymotrypsin (components of the complement cascade and the 
blood-clotting cascade) lysosomal cathepsins, calpains, pepsin, renin, and chymosin (Beynon, R. J. 
and J.S. Bond (1994) Proteolytic Enzymes: A Practical Approach. Oxford University Press, New 
York NY, pp. 1-5). 

The phosphodiesterases catalyze the hydrolysis of one of the two ester bonds in a 
phosphodiester compound. Phosphodiesterases are therefore crucial to a variety of cellular processes. 
Phosphodiesterases include DNA and RNA endo- and exo-nucleases, which are essential to cell 
growth and replication as well as protein synthesis. Another phosphodiesterase is acid 
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sphingomyelinase, wMch hydrolyzes tbe membrane phospholipid sphingomyelin to ceramide and 
phosphorylcholine. Phosphorylcholine is used in the synthesis of phosphatidylcholine, which is 
involved in numerous intracellular signaling pathways. Ceramide is an essential precursor for the 
generation of gangliosides, membrane lipids found in high concentration in neural tissue. Defective 
acid sphingomyelinase phosphodiesterase leads to a build-up of sphingomyelin molecules in 
lysosomes, resulting in Niemann-Pick disease. 

Glycosidases catalyze the cleavage of hemiacetyl bonds of glycosides, which are compounds 
that contain one or more sugar. Mammalian lactase-phlorizin hydrolase, for example, is an intestinal 
enzyme that splits lactose. Mammalian beta-galactosidase removes the terminal galactose from 
gangliosides, glycoproteins, and glycosaminoglycans, and deficiency of this enzyme is associated 
with a gangliosidosis known as Morquio disease type B. Vertebrate lysosomal alpha-glucosidase, 
which hydrolyzes glycogen, maltose, and isomaltose, and vertebrate intestinal sucrase-isomaltase, 
which hydrolyzes sucrose, maltose, and isomaltose, are widely distributed members of this family 
with highly conserved sequences at their active sites. 

The glyoxylase system is involved in gluconeogenesis, the production of glucose from 
storage compounds in the body. It consists of glyoxylase I, which catalyzes the formation of S-D- 
lactoylglutathione from methyglyoxal, a side product of triose-phosphate energy metabolism, and 
glyoxylase n, which hydrolyzes S-D-lactoylglutathione to D-lactic acid and reduced glutathione. 
Glyoxylases are involved in hyperglycemia, non-insulin-dependent diabetes mellitus, the 
detoxification of bacterial toxins, and in the control of cell proliferation and microtubule assembly. 
Lyases 

Lyases are a class of enzymes that catalyze the cleavage of C-C, C-O, C-N, C-S, C-(halide), 
P-0 or other bonds without hydrolysis or oxidation to f onn two molecules, at least one of which 
contains a double bond (Stryer, L. (1995) Biochemistry W.H. Freeman and Co. New York, NY 
p.620). Lyases are critical components of cellular biochemistry with roles in metabolic energy 
production including fatty acid metabolism, as well as other diverse enzymatic processes. Further 
classification of lyases reflects the type of bond cleaved as well as the nature of the cleaved group. 

The group of C-C lyases include carboxyl-lyases (decarboxylases), aldehyde-lyases 
(aldolases), oxo-acid-lyases and others. The C-0 lyase group includes hydro-lyases, lyases acting on 
polysaccharides and other lyases. The C-N lyase group includes ammonia-lyases, amidine-lyases, 
amine-lyases (deaminases) and other lyases. 

Proper regulation of lyases is critical to normal physiology. For example, mutation induced 
deficiencies in the uroporphyrinogen decarboxylase can lead to photosensitive cutaneous lesions in 
the genetically-linked disorder familial porphyria cutanea tarda (Mendez, M. et al. (1998) Am. J. 
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Genet 63:1363-1375). It has also been shown that adenosine deaminase (ADA) deficiency stems 
from genetic mutations in the ADA gene, resulting in the disorder severe combined 
immunodeficiency disease (SCID) (Hershfield, M.S. (1998) Semin. Hematol. 35:291-298). 
Isomerases 

5 Isomerases are a class of enzymes that catalyze geometric or structural changes within a 

molecule to form a single product This class includes racemases and epimerases, cis-trans- 
isomerases, intramolecular oxidoreductases, intramolecular transferases (mutases) and intramolecular 
lyases. Isomerases are critical components of cellular biochemistry with roles in metabolic energy 
production including glycolysis, as well as other diverse enzymatic processes (Stryer, L. (1995) 
10 Biochemistry , W.H. Freeman and Co., New York NY, pp.483-507) . 

Racemases are a subset of isomerases that catalyze inversion of a molecules configuration 
. around the asymmetric carbon atom in a substraie having a single center of asymmetry, thereby 
interconverting two racemers. Epimerases are another subset of isomerases that catalyze inversion of 
configuration around an asymmetric carbon atom in a substrate with more than one center of 
15 symmetry, thereby interconverting two epimers. Racemases and epimerases can act on amino acids 
and derivatives, hydroxy acids and derivatives, as well as carbohydrates and derivatives. The 
. interconversion of UDP-galactose and UDP-glucose is catalyzed by UDP-galactose-4' -epimerase. 
Proper regulation and function of this epimerase is essential to the synthesis of glycoproteins and 
glycolipids. Elevated blood galactose levels have been correlated with UDP-galactose-4' -epimerase 
20 deficiency in screening programs of infants (Gitzelmann, R. (1972) Helv. Paediat Acta 27:125-130). 

Oxidoreductases can be isomerases as well. Oxidoreductases catalyze the reversible transfer 
of electrons from a substrate that becomes oxidized to a substrate that becomes reduced. This class of 
enzymes includes dehydrogenases, hydroxylases, oxidases, oxygenases, peroxidases, and reductases. 
Proper maintenance of oxidoreductase levels is physiologically important For example, genetically- 

2 5 linked deficiencies in lipoamide dehydrogenase can result in lactic acidosis (Robinson, B.H. et al. 

(1977) Pediat Res. 11:1198-1202). 

Another subgroup of isomerases are the transferases (or mutases). Transferases transfer a 
chemical group from one compound (the donor) to another compound (the acceptor). The types of 
groups transferred by these enzymes include acyl groups, amino groups, phosphate groups 

3 o (phosphotransferases or phosphomutases), and others. Hie transferase carnitine palmitoyltransferase 

is an important component of fatty acid metabolism. Genetically-linked deficiencies in this 
transferase can lead to myopathy (Scriver, C.R. et al. (1995) Hie Metabolic and Molecular Basis of 
Inherited Disease . McGraw-Hill, New York NY, pp.1501-1533). 

Yet another subgroup of isomerases are the topoisomersases. Topoisomerases are enzymes 
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that affect the topological state of DNA For example, defects in topoisomerases or their regulation 
can affect normal physiology. Reduced levels of topoisomerase n have been correlated with some of 
the DNA processing defects associated with the disorder ataxia-telangiectasia (Singh, S.P. et al. 
(1988) Nucleic Acids Res. 16:3919-3929). 
5 Ligases 

Ligases catalyze the formation of a bond between two substrate molecules. The process 
involves the hydrolysis of a pyrophosphate bond in ATP or a similar energy donor. Ligases are 
classified based on the nature of the type of bond they form, which can include carbon-oxygen, 
carbon-sulfur, carbon-nitrogen, carbon-carbon and phosphoric ester bonds. 

1 o Ligases forming carbon-oxygen bonds include the aminoacyl-transf er RNA (tRNA) 

synthetases which are important RNA-associated enzymes with roles in translation. Protein 
biosynthesis depends on each amino acid forming a linkage with the appropriate tRNA. The 
aminoacyl-tRNA synthetases are responsible for the activation and correct attachment of an amino 
acid with its cognate tRNA. The 20 aminoacyl-tRNA synthetase enzymes can be divided into two 

15 structural classes, and each class is characterized by a distinctive topology of the catalytic domain. 
Class I enzymes contain a catalytic domain based on the nucleotide-binding Rossman fold. Class II 
enzymes contain a central catalytic domain, which consists of a seven-stranded antiparallel B-sheet 
motif, as well as N-<and C- terminal regulatory domains. Class II enzymes are separated into two 
groups based on the heterodimeric or homodimeric structure of the enzyme; the latter group is further 

20 subdivided by the structure of the N- and C-terminal regulatory domains (HaiHein, M. and S. Gusack 
(1995) J. Mol. Evol. 40:519-530). Autoantibodies against aminoacyl-tRNAs are generated by 
patients with dearmatomyositis and polymyositis, and correlate strongly with complicating interstitial 
lung disease (ILD). These antibodies appear to be generated in response to viral infection, and 
coxsackie virus has been used to induce experimental viral myositis in animals. 

2 5 Ligases forming carbon-sulfur bonds (Acid-thiol ligases) mediate a large number of cellular 

biosynthetic intermediary metabolism processes involve intermolecular transfer of carbon 
atom-containing substrates (carbon substrates). Examples of such reactions include the tricarboxylic 
acid cycle, synthesis of fatty acids and long-chain phospholipids, synthesis of alcohols and aldehydes, 
synthesis of intermediary metabolites, and reactions involved in the amino acid degradation 

3 o pathways. Some of these reactions require input of energy, usually in the form of conversion of ATP 

to either ADP or AMP and pyrophosphate. 

In many cases, a carbon substrate is derived from a small molecule containing at least two 
carbon atoms. The carbon substrate is often covalently bound to a larger molecule which acts as a 
carbon substrate carrier molecule within the cell. In the biosynthetic mechanisms described above, 
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the carrier molecule is coenzyme A. Coenzyme A (CoA) is structurally related to derivatives of the 
nucleotide ADP and consists of 4'-phosphopantetheine linked via a phosphodiester bond to the alpha 
phosphate group of adenosine 3',5'-bisphosphate. The terminal thiol group of 4'-phosphopantetheine 
acts as the site for carbon substrate bond formation. The predominant carbon substrates which utilize 
CoA as a carrier molecule during biosynthesis and intermediary metabolism in the cell are acetyl, 
succinyl, and propionyl moieties, collectively referred to as acyl groups. Other carbon substrates 
include enoyl lipid, which acts as a fatty acid oxidation intermediate, and carnitine, which acts as an 
acetyl-CoA flux regulator/ mitochondrial acyl group transfer protein. Acyl-CoA and acetyl-CoA are 
synthesized in the cell by acyl-CoA synthetase and acetyl-CoA synthetase, respectively. 

Activation of fatty adds is mediated by at least three forms of acyl-CoA synthetase activity: 
i) acetyl-CoA synthetase, which activates acetate and several other low molecular weight carboxylic 
acids and is found in muscle mitochondria and the cytosoi of other tissues; ii) medium-chain 
acyl-CoA synthetase, which activates fatty acids containing between four and eleven carbon atoms 
(predominantly from dietary sources), and is present only in liver mitochondria; and iii) acyl CoA 
synthetase, which is specific for long chain fatty acids with between six and twenty carbon atoms, and 
is found in microsomes and the mitochondria. Proteins associated with acyl-CoA synthetase activity 
have been identified from many sources including bacteria, yeast, plants, mouse, and man. The 
activity of acyl-CoA synthetase may be modulated by phosphorylation of the enzyme by 
cAMP-dependent protein kinase. 

Ligases forming carbon-nitrogen bonds include amide synthases such as glutamine synthetase 
(glutamate-ammonia ligase) that catalyzes the animation of glutamic acid to glutamine by ammonia 
using the energy of ATP hydrolysis. Glutamine is the primary source for the amino group in various 
amide transfer reactions involved in de novo pyrimidine nucleotide synthesis and in purine and 
pyrimidine ribonucleotide interconversions. Overexpression of glutamine synthetase has* been 
observed in primary liver cancer (Christa, L. et al. (1994) Gastroent 106:1312-1320). 

Acid-amino-acid ligases (peptide synthases) are represented by the ubiquitin proteases which 
are associated with the ubiquitin conjugation system (UCS), a major pathway for the degradation of 
cellular proteins in eukaryotic cells and some bacteria. The UCS mediates the elimination of 
abnormal proteins and regulates the half-lives of important regulatory proteins that control cellular 
processes such as gene transcription and cell cycle progression. In the UCS pathway, proteins 
targeted for degradation are conjugated to a ubiquitin (Ub), a small heat stable protein. Ub is first 
activated by a ubiquitin-activating enzyme (El), and then transferred to one of several Ub- 
conjugating enzymes (E2). E2 then links the Ub molecule through its C-terminal glycine to an 
internal lysine (acceptor lysine) of a target protein. The ubiquitinated protein is then recognized and 
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degraded by proteasome, a large, multisubunit proteolytic enzyme complex, and ubiquitin is released 
for reutilization by ubiquitin protease. The UCS is implicated in the degradation of mitotic cyclic 
kinases, oncoproteins, tumor suppressor genes such as p53, viral proteins, cell surface receptors 
associated with signal transduction, transcriptional regulators, and mutated or damaged proteins 
(Ciechanover, A. (1994) Cell 79:13-21). A murine proto-oncogene, Unp, encodes a nuclear ubiquitin 
protease whose overexpression leads to oncogenic transformation of NIH3T3 cells, and the human 
homolog of this gene is consistently elevated in small cell tumors and adenocarcinomas of the lung 
(Gray, D.A. (1995) Oncogene 10:2179-2183). 

Cyclo-ligases and other carbon-nitrogen ligases comprise various enzymes and enzyme 
complexes that participate in the de novo pathways to purine and pyrimidine biosynthesis. Because 
these pathways are critical to the synthesis of nucleotides for replication of both RNA and DNA, 
many of these enzymes have been the targets of clinical agents for the treatment of ceil proliferative 
disorders such as cancer and infectious diseases. 

Purine biosynthesis occurs de novo from the amino acids glycine and glutamine, and other 
small molecules. Three of the key reactions in this process are catalyzed by a Afunctional enzyme 
composed of glycinamide-ribonucleotide synthetase (GARS), aminoimidazole ribonucleotide 
synthetase (AIRS), and glycinamide ribonucleotide transformylase (GART). Together these three 
enzymes combine ribosylamine phosphate with glycine to yield phosphoribosyl aminoimidazole, a 
precursor to both adenylate and guanylate nucleotides. This triftmctional protein has been implicated 
in the pathology of Downs syndrome (Aimi, J. et al. (1990) Nucleic Acid Res. 1 8:6665-6672). 
Adenylosuccinate synthetase catalyzes a later step in purine biosynthesis that converts inosinic acid to 
adenylosuccinate, a key step on the path to ATP synthesis. This enzyme is also similar to another 
carbon-nitrogen ligase, argininosuccinate synthetase, that catalyzes a similar reaction in the urea cycle 
(Powell, S.M. et al. (1992) FEBS Lett 303:4-10). 

Like the de novo biosynthesis of purines, de novo synthesis of the pyrimidine nucleotides 
uridylate and cytidylate also arises from a common precursor, in this instance the nucleotide 
orotidylate derived from orotate and phosphoribosyl pyrophosphate (PPRP). Again a trifunctional 
enzyme comprising three carbon-nitrogen ligases plays a key role in the process. In this case the 
enzymes aspartate transcarbamylase (ATCase), carbamyl phosphate synthetase II, and dihydroorotase 
(DHOase) are encoded by a single gene called CAD. Together these three enzymes combine the 
initial reactants in pyrimidine biosynthesis, glutamine, CO^ and ATP to form dihydroorotate, the 
precursor to orotate and orotidylate (Iwahana, H. et al. (1996) Biochem. Biophys. Res. Commun. 
219:249-255). Further steps then lead to the synthesis of uridine nucleotides from orotidylate. 
Cytidine nucleotides are derived from uridines-triphosphate (UTP) by the amidation of UTP using 
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glutamine as the amino donor and the enzyme CTP synthetase. Regulatory mutations in the human 
CTP synthetase are believed to confer multi-drug resistance to agents widely used in cancer therapy 
(Yamauchi, M. et al. (1990) EMBO J. 9:2095-2099). 

Ligases forming carbon-carbon bonds include the carboxylases acetyl-CoA carboxylase and 
5 pyruvate carboxylase. Acetyl-CoA carboxylase catalyzes the carboxylation of acetyl-CoA from CO^ 
and RJO using the energy of ATP hydrolysis. Acetyl-CoA carboxylase is the rate-limiting step in the 
biogenesis of long-chain fatty acids. Two isofonns of acetyl-CoA carboxylase, types I and types n, 
are expressed in human in a tissue-specific manner (Ha, J. et al. (1994) Eur. J. Biochem. 219:297- 
306); Pyruvate carboxylase is a nuclear-encoded mitochondrial enzyme that catalyzes the conversion 

10 of pyruvate to oxaloacetate, a key intermediate in the citric acid cycle. 

Ligases forming phosphoric ester bonds include the DNA ligases involved in both DNA 
replication and repair. DNA ligases seal phosphodiester bonds between two adjacent nucleotides in a 
DNA chain using the energy from ATP hydrolysis to first activate the free 5'-phosphate of one 
nucleotide and then react it with the 3'-OH group of the adjacent nucleotide. This reselling reaction 

is is used in both DNA replication to join small DNA fragments called Okazaki fragments that are 
transiently formed in the process of replicating new DNA, and in DNA repair. DNA repair is the 
process by which accidental base changes, such as those produced by oxidative damage, hydrolytic 
attack, or uncontrolled methylation of DNA, are corrected before replication or transcription of the 
DNA can occur. Bloom's syndrome is an inherited human disease in which individuals are partially 

2 o deficient in DNA ligation and consequently have an increased incidence of cancer (Alberts, B . et al. 

(1994) The Molecular Biology of the Cell , Garland Publishing Inc., New York NY, p. 247). 

Molecules Associated with Growth and Development 

Human growth and development requires the spatial and temporal regulation of cell 
25 differentiation, cell proliferation, and apoptosis. These processes coordinately control reproduction, 
aging, embryogenesis, morphogenesis, organogenesis, and tissue repair and maintenance. At the 
cellular level, growth and development is governed by the cell's decision to enter into or exit from the 
cell division cycle and by the cell's commitment to a terminally differentiated state. These decisions 
are made by the cell in response to extracellular signals and other environmental cues it receives. The 

3 o following discussion focuses on the molecular mechanisms of cell division, reproduction, cell 

differentiation and proliferation, apoptosis, and aging. 
Cell Division 

Cell division is the fundamental process by which all living things grow and reproduce. In 
unicellular organisms such as yeast and bacteria, each cell division doubles the number of organisms, 
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while in multicellular species many rounds of cell division are required to replace cefls lost by wear or 
by programmed cell death, and for cell differentiation to produce a new tissue or organ. Details of the 
cell division cycle may vary, but the basic process consists of three principle events. Hie first event, 
interphase, involves preparations for cell division, replication of the DNA, and production of essential 
5 proteins. In the second event, mitosis, the nuclear material is divided and separates to opposite sides of 
the cell. The final event, cytokinesis, is division and fission of the cell cytoplasm Hie sequence and 
timing of cell cycle transitions is under the control of the cell cycle regulation system which controls the 
process by positive or negative regulatory circuits at various check points. 

Regulated progression of the cell cycle depends on the integration of growth control pathways 

1 o with the basic cell cycle machinery. Cell cycle regulators have been identified by selecting for human 

and yeast cDN As that block or activate cell cycle arrest signals in the yeast mating pheromone pathway 
when ihey are overexpressed. Known regulators include human CPR (cell cycle progression 
restoration) genes, such as CPR8 and CPR2, and yeast CDC (cefl division control) genes, including 
CDC91, that block the arrest signals. The CPR genes express a variety of proteins including cyclins, 

15 tumor suppressor binding proteins, chaperones, transcription factors, translation factors, and 
RNA-binding proteins (Edwards, M.C. et al.(1997) Genetics 147:1063-1076). 

Several cell cycle transitions, including the entry and exit of a cell from mitosis, are dependent 
upon the activation and inhibition of cyclin-dependent kinases (Cdks). The Cdks are composed of a 
kinase subunit, Cdk, and an activating subunit, cyclin, in a complex that is subject to many levels of 

20 regulatioa There appears to be a single Cdk in Saccharomvces cerevisiae and Saccharomvces pombe 
whereas mammals have a variety of specialized Cdks. Cyclins act by binding to and activating 
cyclin-dependent protein kinases which then phosphorylate and activate selected proteins involved in the 
mitotic process. The Cdk-cyclin complex is both positively and negatively regulated by 
phosphorylation, and by targeted degradation involving molecules such as CDC4 and CDC53. In 

2 5 addition, Cdks are further regulated by binding to inhibitors and other proteins such as Sucl that 

modify their specificity or accessibility to regulators (Patra, D. and W.G. Dunphy (1996) Genes Dev. 

10:1503-1515; and Mathias, N. et al. (1996) Mol. Ce*l Biol. 16:6634-6643). 

Reproduction 

The male and female reproductive systems are complex and involve many aspects of growth 
30 and development The anatomy and physiology of the male and female reproductive systems are 
reviewed in (Guyton, A,C. (1991) Textbook of Medical Physiology . W.B. Saunders Co., Philadelphia 
PA, pp. 899-928). 

The male reproductive system includes the process of spermatogenesis, in which the sperm are 
formed, and male reproductive functions are regulated by various hormones and their effects on 
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accessory sexual organs, cellular metabolism, growth, and other bodily functions. 

Spermatogenesis begins at puberty as a result of stimulation by gonadotropic hormones 
released from the anterior pituitary. Immature sperm (spermatogonia) undergo several mitotic cell 
divisions before undergoing meiosis and full maturation. The testes secrete several male sex hormones, 
5 the most abundant being testosterone, that is essential for growth and division of the immature spam, 
and for the masculine characteristics of the male body. Three other male sex hormones, gonadotropin- 
releasing hormone (GnRH), luteinizing hormone (LH), and follicle-stimulating hormone (FSH) control 
sexual function. 

The uterus, ovaries, fallopian tubes, vagina, and breasts comprise the female reproductive 

1 o system. The ovaries and uterus are the source of ova and the location of fetal development, 

respectively. The fallopian tubes and vagina are accessory organs attached to the top and bottom of the 
uterus, respectively. Both the uterus and ovaries have additional roles in the development and loss of 
reproductive capability during a female's lifetime. The primary role of the breasts is lactation. 
Multiple endocrine signals from the ovaries, uterus, pituitary, hypothalamus, adrenal glands, and other 
1 5 tissues coordinate reproduction and lactation. These signals vary during the monthly menstruation 
cycle and during the female's lifetime. Similarly, the sensitivity of reproductive organs to these 
endocrine signals varies during the female's lifetime. 

A combination of positive and negative feedback to the ovaries, pituitary and hypothalamus 
glands controls physiologic changes during the monthly ovulation and endometrial cycles. The anterior 

2 o pituitary secretes two major gonadotropin hormones, follicle-stimulating hormone (FSH) and luteinizing 

hormone (LH), regulated by negative feedback of steroids, most notably by ovarian estradiol. If 
fertilization does not occur, estrogen and progesterone levels decrease. This sudden reduction of the 
ovarian hormones leads to menstruation, the desquamation of the endometrium. 

Hormones further govern all the steps of pregnancy, parturition, lactation, and menopause. 

2 5 During pregnancy large quantities of human chorionic gonadotropin (hCG), estrogens, progesterone, 

and human chorionic somatomammotropin (hCS) are formed by the placenta. hCG, a glycoprotein 
similar to luteinizing hormone, stimulates the corpus luteum to continue producing more progesterone 
and estrogens, rather than to involute as occurs if the ovum is not fertilized. hCS is similar to growth 
hormone and is crucial for fetal nutrition. 

3 o The female breast also matures during pregnancy. Large amounts of estrogen secreted by the 

placenta trigger growth and branching of the breast milk ductal system while lactation is initiated by the 
secretion of prolactin by the pituitary gland. 

Parturition involves several hormonal changes that increase uterine contractility toward the end 
of pregnancy, as follows. The levels of estrogens increase more than those of progesterone. Oxytocin 

18 



WO 01/62927 



PCT/US01/06059 



is secreted by the neurohypophysis. Concomitantly, uterine sensitivity to oxytocin increases. The fetus 
itself secretes oxytocin, Cortisol (from adrenal glands), and prostaglandins. 

Menopause occurs when most of the ovarian follicles have degenerated. The ovary then 
produces less estradiol, reducing the negative feedback on the pituitary and hypothalamus glands. 
Mean levels of circulating FSH and LH increase, even as ovulatory cycles continue. Therefore, the 
ovary is less responsive to gonadotropins, and there is an increase in the time between menstrual cycles. 
Consequently, menstrual bleeding ceases and reproductive capability ends. 
Cell Differentiation and Proliferation 

Tissue growth involves complex and or (toed patterns of cell proliferation, cell differentiation, 
and apoptosis. Cell proliferation must be regulated to maintain both the number of cells and their 
spatial organizatioa This regulation depends upon the appropriate expression of proteins which control 
cell cycle progression in response to extracellular signals, such as growth factors and other mitogens, 
and intracellular cues, such as DNA damage or nutrient starvation. Molecules which directly or 
indirectly modulate cell cycle progression fall into several categories, including growth factors and their 
receptors, second messenger and signal transduction proteins, oncogene products, tumor-suppressor 
proteins, and mitosis-promoting factors. < 

Growth factors were originally described as serum factors required to promote cell 
proliferation. Most growth factors are large, secreted polypeptides that act on cells in their local 
environment Growth factors bind to and activate specific ceil surface receptors and initiate 
intracellular signal transduction cascades. Many growth factor receptors are classified as receptor 
tyrosine kinases which undergo autophosphorylation upon ligand binding. Autophosphorylation 
enables the receptor to interact with signal transduction proteins characterized by the presence of SH2 
or SH3 domains (Src homology regions 2 or 3). These proteins then modulate the activity state of 
small G-proteins, such as Ras, Rab, and Rho, along with GTPase activating proteins (GAPs), guanine 
nucleotide releasing proteins (GNRPs), and other guanine nucleotide exchange factors. Small G 
proteins act as molecular switches that activate other downstream events, such as mitogen-activated 
protein kinase (MAP kinase) cascades. MAP kinases ultimately activate transcription of mitosis- 
promoting genes. 

In addition to growth factors, small signaling peptides and hormones also influence cell 
proliferation. These molecules bind primarily to another class of receptor, the trimeric G-protein 
coupled receptor (GPCR), found predominantly on the surface of immune, neuronal and neuroendocrine 
cells. Upon ligand binding, the GPCR activates a trimeric G protean which in turn triggers increased 
levels of intracellular second messengers such as phospholipase C, Ca2+, and cyclic AMP. Most 
GPCR-mediated signaling pathways indirectly promote cell proliferation by causing the secretion or 
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breakdown of other signaling molecules that have direct mitogenic effects. These signaling cascades 
often involve activation of kinases and phosphatases. Some growth factors, such as some members of 
the transforming growth factor beta (TGF-P) family, act on some cells to stimulate cell proliferation 
and on other cells to inhibit it Growth factors may also stimulate a cell at one concentration and inhibit 
5 the same cell at another concentratioa Most growth factors also have a multitude of other actions 
besides the regulation of cell growth and division: they can control the proliferation, survival, 
differentiation, migration, or function of cells depending on the circumstance. For example, the tumor 
necrosis factor/nerve growth factor (TNF/NGF) family can activate or inhibit cell death, as well as 
regulate proliferation and diff erentiatioiL The cell response depends on the type of cell, its stage of 

10 differentiation and transformation status, which surface receptors are stimulated, and the types of 

stimuli acting on the cell (Smith, A. et al. (1994) Cell 76:959-962; and Nocentini, G. et al. (1997) Proc. 
Natl. Acad Sci. USA 94:6216-6221). 

Neighboring cells in a tissue compete for growth factors, and when provided with "unlimited" 
quantities in a perfused system will grow to even higher cell densities before reaching density-dependent 

15 inhibition of cell divisioa Cells often demonstrate an anchorage dependence of cell division as well. 
This anchorage dependence may be associated with the formation of focal contacts linking the 
cytoskeleton with the extracellular matrix (ECM). The expression of ECM components can be 
stimulated by growth factors. For example, TGF-p stimulates fibroblasts to produce a variety of ECM 
proteins, including fibronectin, collagen, andtenascin (Pearson, C.A. et al. (1988) EMBO J. 7:2677- 

2 o 298 1). In fact, for some cell types specific ECM molecules, such as laminin or fibronectin, may act as 
growth factors. Tenascin-C and -R, expressed in developing and lesioned neural tissue, provide 
stimulatory/anti-adhesive or inhibitory properties, respectively, for axonal growth (Faissner, A (1997) 
Cell Tissue Res. 290:331-341). 

Cancers are associated with the activation of oncogenes which are derived from normal cellular 

25 genes. These oncogenes encode oncoproteins which convert normal cells into malignant cells. Some 
oncoproteins are mutant isoforms of the normal protein, and other oncoproteins are abnormally 
expressed with respect to location or amount of expressioa The latter category of oncoprotein causes 
cancer by altering transcriptional control of cell proliferatioa Five classes of oncoproteins are known 
to affect cell cycle controls. These classes include growth factors, growth factor receptors, intracellular 

' 3 o signal transducers, nuclear transcription factors, and cell-cycle control proteins. Viral oncogenes are 
integrated into the human genome after infection of human cells by certain viruses. Examples of viral 
oncogenes include v-src, v-abl, and v-ips. 

Many oncogenes have been identified and characterized. Hiese include sis, erbA, erbB, her-2, 
mutated G s , src, abl, ras, crk, jun, fos, myc, and mutated tumor-suppressor genes such as RB, p53, 
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mdm2, Cipl , pi 6, and cyclin D. Transformatioii of normal genes to oncogenes may also occur by 
chromosomal translocatioa The Philadelphia chromosome, characteristic of chronic myeloid leukemia 
and a subset of acute lymphoblastic leukemias, results from a reciprocal translocation between 
chromosomes 9 and 22 that moves a truncated portion of the proto-oncogene c-abl to the breakpoint 
5 cluster region (bcr) on chromosome 22. 

Tumor-suppressor genes are involved in regulating cell proliferation. Mutations which cause 
reduced or loss of function in tumor-suppressor genes result in uncontrolled cell proliferation. For 
example, the retinoblastoma gene product (RB), in a non-phosphorylated state, binds several early- 
response genes and suppresses their transcription, thus blocking cell divisioa Phosphorylation of RB 
10 causes it to dissociate from the genes, releasing the suppression, and allowing cell division to proceed. 
Apoptosis 

Apoptosis is the genetically controlled process by which unneeded or defective cells undergo 
programmed cell death. Selective elimination of cells is as important for morphogenesis and tissue 
remodeling as is cell proliferation and differentiation. Lack of apoptosis may result in hyperplasia and 
1 5 other disorders associated with increased cell proliferation. Apoptosis is also a critical component of 
the immune response. Immune cells such as cytotoxic T-cells and natural 10110- cells prevent the spread 
of disease by inducing apoptosis in tumor cells and virus-infected cells. In addition, immune cells that 
fail to distinguish self molecules from foreign molecules must be eliminated by apoptosis to avoid an 
autoimmune response. 

2 o Apqptotic cells undergo distinct morphological changes. Hallmarks of apoptosis include cell 

shrinkage, nuclear and cytoplasmic condensation, and alterations in plasma membrane topology. 
Biochemically, apoptotic cells are characterized by increased intracellular calcium concentration, 
fragmentation of chromosomal DNA, and expression of novel cell surface components. 

The molecular mechanisms of apoptosis are highly conserved, and many of the key protein 

2 5 regulators and effectors of apoptosis have been identified. Apoptosis generally proceeds in response to 

a signal which is transduced intracellularly and results in altered patterns of gene expression and protein 
activity. Signaling molecules such as hormones and cytokines are known both to stimulate and to 
inhibit apoptosis through interactions with cell surface receptors. Transcription factors also play an 
important role in the onset of apoptosis. A number of downstream effector molecules, particularly 

3 o proteases such as the cysteine proteases called caspases, have been implicated in the degradation of 

cellular components and the proteolytic activation of other apoptotic effectors. 
Aging and Senescence 

Studies of the aging process or senescence have shown a number of characteristic cellular and 
molecular changes (Fauci et al. (1998) Harrison's Principles of Internal Medicine . McGraw-Hill, New 
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York NY, p.37). These characteristics include increases in chromosome structural abnormalities, DNA 
cross-linking, incidence of single-stranded breaks in DNA, losses in DNA methylation, and degradation 
of telomere regions. In addition to these DNA changes, post-translational alterations of proteins 
increase including, deamidation, oxidation, cross-linking, and nonenzymatic glycation. Still further 
5 molecular changes occur in the mitochondria of aging cells through deterioration of structure. These 
changes eventually contribute to decreased function in every organ of the body. 

Biochemical Pathway Molecules 

Biochemical pathways are responsible for regulating metabolism, growth and development, 
10 protein secretion and trafficking, environmental responses, and ecological interactions including 
immune response and response to parasites. 
DNA replication 

Deoxyribonucleic acid (DNA), the genetic material, is found in both the nucleus and 
mitochondria of human cells. The bulk of human DNA is nuclear, in the form of linear chromosomes, 

1 5 while mitochondrial DNA is circular. DNA replication begins at specific sites called origins of 

replicatioa Bidirectional synthesis occurs from the origin via two growing forks that move in opposite 
directions. Replication is semi-conservative, with each daughter duplex containing one old strand and 
its newly synthesized complementary partner. Proteins involved in DNA replication include DNA 
polymerases, DNAprimase, telomerase, DNAhelicase, topoisomerases, DNAligases, replication 

2 o factors, and DNA-binding proteins. 
DNA Recombination and Repair 

Cells are constantly faced with replication errors and environmental assault (such as ultraviolet 
irradiation) that can produce DNA damage. Damage to DNA consists of any change that modifies the 
structure of the molecule. Changes to DNA can be divided into two general classes, single base 

2 5 changes and structural distortions. Any damage to DNA can produce a mutation, and the mutation may 

produce a disorder, such as cancer. 

Changes in DNA are recognized by repair systems within the cell. These repair systems act to 
correct the damage and thus prevent any deleterious affects of a mutational event. Repair systems can 
be divided into three general types, direct repair, excision repair, and retrieval systems. Proteins 

3 o involved in DNA repair include DNA polymerase, excision repair proteins, excision and cross link 

repair proteins, recombination and repair proteins, RAD51 proteins, and BLN and WRN proteins that 
are homologs of RecQ helicase. When the repair systems are eliminated, cells become exceedingly 
sensitive to environmental mutagens, such as ultraviolet irradiation. Patients with disorders associated 
with a loss in DNA repair systems often exhibit a high sensitivity to environmental mutagens. 
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Examples of such disorders include xeroderma pigmentosum (XP), Bloom's syndrome (BS), and 
Winer's syndrome (WS) (Yamagata, K. et aL (1998) Proc. Natl. Acad. Sci. USA 95:8733-8738), 
ataxia telangiectasia, Cockayne's syndrome, andFanconi's anemia. 

Recombination is the process whereby new DNA sequences are generated by the movements of 
5 large pieces of DNA. In homologous recombination, which occurs during meiosis and DNA repair, 
parent DNA duplexes align at regions of sequence similarity, and new DNA molecules form by the 
breakage and joining of homologous segments. Proteins involved include RAD5 1 recombinase. In site- 
specific recombination, two specific but not necessarily homologous DNA sequences are exchanged. In 
the immune system this process generates a diverse collection of antibody and T cell receptor genes. 

1 o Proteins involved in site-specific recombination in the immune system include recombination activating 

genes 1 and 2 (RAG1 and RAG2). A defect in immune system site-specific recombination causes 
severe combined immunodeficiency disease in mice. 
RNA Metabolism 

Ribonucleic acid (RNA) is a linear single-stranded polymer of four nucleotides, ATP, CTP, 
is OTP, and GTP. In most organisms, RNA is transcribed as a copy of DNA, the genetic material of the 
organism In retroviruses RNA rather than DNA serves as the genetic material. RNA copies of the 
genetic material encode proteins or serve various structural, catalytic, or regulatory roles in organisms. 
RNA is classified according to its cellular localization and function. Messenger RNAs (mRNAs) 
encode polypeptides. Ribosomal RNAs (rRNAs) are assembled, along withribosomal proteins, into 

2 o ribosomes, which are cytoplasmic particles that translate mRNA into polypeptides. Transfer RNAs 

(tRNAs) are cytosolic adaptor molecules that function in mRNA translation by recognizing both an 
mRNA codon and the amino acid that matches that codoa Heterogeneous nuclear RNAs (hnRNAs) 
include mRNA precursors and other nuclear RNAs of various sizes. Small nuclear RNAs (snRNAs) 
are a part of the nuclear spliceosome complex that removes intervening, non-coding sequences (introns) 
25 and rejoins exons in pre-mRNAs. 
RNA Transcription 

The transcription process synthesizes an RNA copy of DNA. Proteins involved include multi- 
subunit RNA polymerases, transcription factors HA, IIB, ED, IIE, HF, EH, and ID. Many 
transcription factors incorporate DNA-binding structural motifs which comprise either cc-helices or 0- 

3 o sheets that bind to the major groove of DNA. Four well-characterized structural motifs are helix-turn- 

helix, zinc finger, leucine zipper, and helix-loop-helix. 
RNA Processing 

Various proteins are necessary for processing of transcribed RNAs in the nucleus. Pre-mRNA 
processing steps include capping at the 5 ! end with methylguanosine* polyadenylating the 3' end, and 
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splicing to remove introns. The spliceosomal complex is comprised of five small nuclear 
riboiiucleoprotein particles (snRNPs) designated Ul t U2, U4, U5, and U6. Each snRNP contains a 
single species of snRNA and about ten proteins. Hie RNA components of some snRNPs recognize and 
base-pair with intron consensus sequences. The protein components mediate spliceosome assembly and 
the splicing reaction. Autoantibodies to snRNP proteins are found in the blood of patients with 
systemic lupus erythematosus (Stryer, L. (1995) Biochemistry W.H. Freeman and Company, New 
York NY, p. 863), 

Heterogeneous nuclear ribonucleoprotdns (hnRNPs) have been identified that have roles in 
splicing, exporting of the mature RNAs to the cytoplasm, and mRNA translation (Biamonti, G. et al. 
(1998) Clin. Exp. Rheumatol. 16:317-326). Some examples of hnRNPs include the yeast proteins 
Hrplp, involved in cleavage and polyadenylation at the 3' end of the RNA; Cbp80p, involved in 
capping the 5* end of the RNA.; and Npl3p, a homolog of mammalian hnRNP Al, involved in export of 
mRNA from the nucleus (Shen, EC. et al. (1998) Genes Dev. 12:679-691). HnRNPs have been shown 
to be important targets of the autoimmune response in rheumatic diseases (Biamonti, supra) . 

Many snRNP proteins, hnRNP proteins, and alternative splicing factors are characterized by 
an RNA recognition motif (RRM). (Reviewed in Birney, E. et al. (1993) Nucleic Acids Res. 21:5803- 
5816.) The RRM is about 80 amino acids in length and forms four P-strands and two a-helices 
arranged in an a/p sandwich The RRM contains a core RNP-1 octapeptide motif along with 
surrounding conserved sequences. 
RNA Stability and Degradation 

RNA helicases alto and regulate RNA conformation and secondary structure by using energy 
derived from ATP hydrolysis to destabilize and unwind RNA duplexes. The most well-characterized 
and ubiquitous family of RNA helicases is the DEAD-box family, so named for the conserved B-type 
ATP-binding motif which is diagnostic of proteins in this family. Over 40 DEAD-box helicases have 
been identified in organisms as diverse as bacteria, insects, yeast, amphibians, mammals, and plants. 
DEAD-box helicases function in diverse processes such as translation initiation, splicing, ribosome 
assembly, and RNA editing, transport, and stability. Some DEAD-box helicases play tissue- and stage- 
specific roles in spermatogenesis and embryogenesis. (Reviewed in Linder, P. et al. (1989) Nature 
337:121-122.) 

Overexpression of the DEAD-box 1 protein (DDX1) may play a role in the progression of 
neuroblastoma (Nb) and retinoblastoma (Rb) tumors. Other DEAD-box helicases have been implicated 
either directly or indirectly in ultraviolet light-induced tumors, B cell lymphoma, and myeloid 
malignancies. (Reviewed in Godbout, R. et al. (1998) J. Biol. Chem. 273:21161-21168.) 

Ribomicleases (RNases) catalyze the hydrolysis of phosphodiester bonds in RNA chains, thus 
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cleaving the RNA. For example, RNase P is a ribonucleoprotein enzyme which cleaves the 5 * end of 
pre-tRNAs as part of their maturation process. RNase H digests the RNA strand of an RNA/DNA 
hybrid Such hybrids occur in cells invaded by retroviruses, and RNase H is an important enzyme in 
the retroviral replication cycle. RNase H domains are often found as a domain associated with reverse 
5 transcriptases. RNase activity in serum and cell extracts is elevated in a variety of cancers and 

infectious diseases (Schein, C.H. (1997) Nat Biotechnol. 15:529-536). Regulation of RNase activity is 
being investigated as a means to control tumor angiogenesis, allergic reactions, viral infection and 
replication, and fungal infections. 
Protein Translation 

10 The eukaryotic ribosome is composed of a 60S (large) subunit and a 40S (small) subunit, 

which together form the 80S ribosome. In addition to the 1 8S, 28S, 5S, and 5.8S rRNAs, the ribosome 
also contains mere than fifty proteins. The ribosomai proteins have a prefix which denotes the subunit 
to which they belong, either L (large) or S (small). Three important sites are identified on the ribosome. 
The aminoacyl-tRNA site (A site) is where charged tRNAs (with the exception of the initiator-tRNA) 

1 5 bind on arrival at the ribosome, The peptidyl-tRNA site (P site) is where new peptide bonds are 
formed, as well as where the initiator tRNA binds. The exit site (E site) is whore deacylated tRNAs 
bind prior to their release from the ribosomew (Translation is reviewed in Stryer, L. (1995) 
Biochemistry, W.H. Freeman and Company, New York NY, pp. 875-908; andLodish, H. et al. (1995) 
Molecular Cell Biology , Scientific American Books, New York NY, pp. 1 19-138.) 

20 tRNA Charging 

Protein biosynthesis depends on each amino acid forming a linkage with the appropriate tRNA. 
The aminoacyl-tRNA synthetases are responsible for the activation and correct attachment of an amino 
acid with its cognate tRNA. The 20 aminoacyl-tRNA synthetase enzymes can be divided into two 
structural classes, Class I and Class II. Autoantibodies against aminoacyl-tRNAs are generated by 

2 5 patients with dermatomyositis and polymyositis, and correlate strongly with complicating interstitial 

lung disease (ILD). 4 These antibodies appear to be generated in response to viral infection, and 
coxsackie virus has been used to induce experimental viral myositis in animals. 
Translation Initiation 

Initiation of translation can be divided into three stages. The first stage brings an initiator 

3 o transfer RNA (Met-tRNA^) together with the 40S ribosomai subunit to form the 43S preanitiation 

complex. The second stage binds the 43S preinitiation complex to the mRNA, followed by migration of 
the complex to the correct AUG initiation codon. The third stage brings the 60S ribosomai subunit to 
the 40S subunit to generate an 80S ribosome at the initiation codon. Regulation of translation primarily 
involves the first and second stage in the initiation process (Pain, V.M. (1996) Eur. J. Biochem. 
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236:747-771). 

Several initiation factors, many of which contain multiple subunits, are involved in bringing an 
initiator tRNA and 40S ribosomal subunit together. eIF2, a guanine nucleotide binding protein, recruits 
the initiator tRNA to the 40S ribosomal subunit Only when eIF2 is bound to GTP does it associate 
5 with the initiator tRNA. eIF2B, a guanine nucleotide exchange protein, is responsible for converting 
eIF2 from the GDP-bound inactive form to the GTP-bound active form. Two other factors, elFIA and 
eBF3 bind and stabilize the 40S subunit by interacting with 18S ribosomal RNA and specific ribosomal 
structural proteins. eIF3 is also involved in association of the 40S ribosomal subunit with mRNA. The 
Met-tRNAf, eBFl A, eIF3, and 40S ribosomal subunit together make up the 43S preinitiation complex 
10 (Pain, supra) . 

Additional factors are required for binding of the 43S preinitiation complex to an mRNA 
molecule, and the process is regulated at several levels. eiF4F is a complex consisting of three proteins: 
erF4E, eIF4A, and eIF4G. eIF4E recognizes and binds to the mRNA 5 -terminal m 7 GTP cap, eIF4A is 
a bidirectional RNA-dependent helicase, and eBF4G is a scaffolding polypeptide. eEF4G has three 

1 5 binding domains. The N-terminal third of eIF4G interacts with eIF4E, the central third interacts with 
eEF4A, and the C-terminal third interacts with eIF3 bound to the 43S preinitiation complex. Thus, 
eEF4G acts as a bridge between the 40S ribosomal subunit and the mRNA (Hentze, M.W. (1997) 
Science 275:500-501). 

The ability of eIF4F to initiate binding of the 43S preinitiation complex is regulated by 

2 o structural features of the mRNA. The mRNA molecule has an untranslated region (UTR) between the 
5 ' cap and the AUG start codon. In some mRNAs this region forms secondary structures that impede 
binding of the 43S preinitiation complex. The helicase activity of eIF4A is thought to function in 
removing this secondary structure to facilitate binding of the 43S preinitiation complex (Pain, supra) . 
Translation Elongation 

2 5 Elongation is the process whereby additional amino acids are joined to the initiator methionine 

to form the complete polypeptide chain. The elongation factors EF1 a, EFlp y, and EF2 are involved 
in elongating the polypeptide chain following initiation. EFla is a GTP-binding protein. InEFla's 
GTP-bound form, it brings an aminoacyl-tRNA to the ribosome's A site. The amino acid attached to 
the newly airived aminoacyl-tRNA forms a peptide bond with the initiator methionine. The GTP on 

30 EFlaishydrolyzedtoGDP, andEFla-GDP dissociates from the ribosome. EFlp y binds EFla -GDP 
and induces the dissociation of GDP from EFla, allowing EFla to bind GTP and a new cycle to begia 

As subsequent aminoacyl-tRNAs are brought to the ribosome, EF-G, another GTP-binding 
protein, catalyzes the translocation of tRN As from the A site to the P site and finally to the E site of the 
ribosome. This allows the processivity of translation. 
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Translation Termination 

The release factor eRF carries out termination of translation eRF recognizes stop codons in 
the mRNA, leading to the release of the polypeptide chain from the ribosome. 
Post-Translational Pathways 

Proteins may be modified after translation by the addition of phosphate, sugar, prenyl, fatty 
acid, and other chemical groups. These modifications are often required for proper protein activity. 
Enzymes involved in post-translational modification include kinases, phosphatases, 
gjycosyltransferases, and prenyltransferases. The conformation of proteins may also be modified after 
translation by the introduction and rearrangement of disulfide bonds (rearrangement catalyzed by 
protein disulfide isomerase), the isomerization of proline sidechains by prolyl isomerase, and by 
interactions with molecular chaperone protdns. 

Proteins may also be cleaved by proteases. Such cleavage may result in activation, 
inactivation, or complete degradation of the protein Proteases include serine proteases, cysteine 
proteases, aspartic proteases, and metalloproteases. Signal peptidase in the endoplasmic reticulum 
(ER) lumen cleaves the signal peptide from membrane or secretory proteins that are imported into the 
ER. Ubiquitin proteases are associated with the ubiquitin conjugation system (UCS), a major 
pathway for the degradation of cellular proteins in eukaryotic cells and some bacteria. The UCS 
mediates the elimination of abnormal proteins and regulates the half-lives of important regulatory 
proteins that control cellular processes such as gene transcription and cell cycle progression. In the 
UCS pathway, proteins targeted for degradation are conjugated to a ubiquitin, a small heat stable 
protein. Proteins involved in Ihe UCS include ubiquitin-activating enzyme, ubiquitin-conjugating 
enzymes, ubiquitin-ligases, and ubiquitin C-terminal hydrolases. The ubiquitinated protein is then 
recognized and degraded by the proteasome, a large, multisubunit proteolytic enzyme complex, and 
ubiquitin is released for reutilization by ubiquitin protease. 
Lipid Metabolism 

Lipids are water-insoluble, oily or greasy substances that are soluble in nonpolar solvents such 
as chloroform or ether. Neutral fats (triacylglycerols) serve as major fuels and energy stores. Polar 
lipids, such as phospholipids, sphingolipids, glycolipids, and cholesterol, are key structural components 
of cell membranes. 

Lipid metabolism is involved in human diseases and disorders. In the arterial disease 
atherosclerosis, fatty lesions form on the inside of the arterial wall. These lesions promote the loss of 
arterial flexibility and the formation of blood clots (Guyton, A.C. Textbook of Medical Physiology 
(1991) W.B. Saunders Company, Philadelphia PA, pp.760-763). InTay-Sachs disease, the GM 2 
ganglioside (a sphingolipid) accumulates in lysosomes of the central nervous system due to a lack of the 
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enzyme N-acetylhexosaminidase. Patients suffer nervous system degeneration leading to early death 
(Fauci, AS. et al. (1998) Harrison's Principles of Internal Medicine McGraw-Hill, New York NY, p. 
2171). The Niemann-Pick diseases are caused by defects in lipid metabolism. Niemann-Pick diseases 
types A and B are caused by accumulation of sphingomyelin (a sphingolipid) and other lipids in the 
central nervous system due to a defect in the enzyme sphingomyelinase, leading to neurodegeneration 
and lung disease. Niemann-Pick disease type C results from a defect in cholesterol transport, leading to 
the accumulation of sphingomyelin and cholesterol in lysosomes and a secondary reduction in 
sphingomyelinase activity. Neurological symptoms such as grand mal seizures, ataxia, and loss of 
previously learned speech, manifest 1-2 years after birth. A mutation in the NPC protein, which 
contains a putative cholesterol-sensing domain, was found in a mouse model of Niemann-Pick disease 
typeC (Fauci, supra , p. 2175; Loftus. S.K. et al. (1997) Science 277:232-235). (Lipid metabolism is 
reviewed in Stryer, L. (1995) Biochemistry. W.H. Freeman and Company, New York NY; Lehninger, 
A. (1982) Principles of Biochemistry Worth Publishers, Inc., New York NY; and ExPASy 
"Biochemical Pathways" index of Boehringer Mannheim World Wide Web site.) 
Fatty Acid Synthesis 

Fatty acids are long-chain organic acids with a single carboxyl group and a long non-polar 
hydrocarbon tail. Long-chain fatty acids are essential components of glycolipids, phospholipids; and 
cholesterol, which are building blocks for biological membranes, and of triglycerides, which are 
biological fuel molecules. Long-chain fatty acids are also substrates for eicosanoid production, and are 
important in the functional modification of certain complex carbohydrates and proteins. 1 6-carbon and 
18-carbon fatty acids are the most common. 

Fatty acid synthesis occurs in the cytoplasm. In the first step, acetyl-Coenzyme A (CoA) 
carboxylase (ACC) synthesizes malonyl-CoA from acetyl-CoA and bicarbonate. The enzymes which 
catalyze the remaining reactions are covalently linked into a single polypeptide chain, referred to as the 
multifunctional enzyme fatty acid synthase (FAS). FAS catalyzes the synthesis of palmitate from 
acetyl-CoA and malonyl-CoA FAS contains acetyl transferase, malonyl transferase, p-ketoacetyl 
synthase, acyl carrier protein, 0-ketoacyl reductase, dehydratase, enoyl reductase* and thioesterase 
activities. The final product of the FAS reaction is the 1 6-carbon fatty acid palmitate. Further 
elongation, as well as unsaturation, of palmitate by accessory enzymes of the ER produces the variety 
of long chain fatty acids required by the individual cell. These enzymes include a NADH-cytochrome 
b 5 reductase, cytochrome b 5 , and a desaturase. 
Phospholipid and Triacvlglvcerol Synthesis 

Triacylglycerols, also known as triglycerides and neutral fats, are major energy stores in 
animals. Triacylglycerols are esters of glycerol with three fatty acid chains. Glycerol-3-phosphate is 
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produced from dihydroxyacetone phosphate by the enzyme glycerol phosphate dehydrogenase or from 
glycerol by glycerol kinase. Fatty acid-CoA's are produced from fatty acids by fatty acyl-CoA 
synthetases. Glyercol-3-phosphate is acylated with two fatty acyl-CoA's by the enzyme glycerol 
phosphate acyltransferase to give phosphatidate. Phosphatidate phosphatase converts phosphatidate to 
diacylglycerol, which is subsequently acylated to a triacylglyercol by the enzyme diglyceride 
acyltransferase. Phosphatidate phosphatase and diglyceride acyltransferase form a triacylglyerol 
synthetase complex bound to the ER membrane. 

A major class of phospholipids are the phosphoglycerides, which are composed of a glycerol 
backbone, two fatty acid chains, and a phosphorylated alcohol. Phosphoglycerides are components of 
cell membranes. Principal phosphoglycerides are phosphatidyl choline, phosphatidyl ethanolamine, 
phosphatidyl serine, phosphatidyl inositol, and diphosphatidyl glycerol. Many enzymes involved in 
phosphoglyceride synthesis are associated with membranes (Meyers, R.A. ( 1995). Molecular Biology 
and Biotechnology . VCH Publishers Inc., New YorkNY, pp. 494-501). Phosphatidate is converted to 
CDP-diacylglycerol by the enzyme phosphatidate cytidylyltransferase (ExPASy ENZYME EC 
2.7.7.41). Transfer of the diacylglycerol group from CDP-diacylglycerol to serine to yield phosphatidyl 
serine, or to inositol to yield phosphatidyl inositol, is catalyzed by the enzymes CDP-diacylglycerol- 
. serine O-phosphatidyltransferase and CDP-diacylglycerol-inositol 3-phosphatidyltransferase, .. 
respectively (ExPASy ENZYME EC 2.7.8.8; ExPASy ENZYME EC 2.7.8.1 1). The enzyme 
phosphatidyl serine decarboxylase catalyzes the conversion of phosphatidyl serine to phosphatidyl 
ethanolamine, using a pyruvate cof actor (Voelker, D.R. (1997) Biochim. Biophys. Acta 1348:236-244). 
Phosphatidyl choline is formed using diet-derived choline by the reaction of CDP-choline with 1,2- 
diacylglycerol, catalyzed by diacylglycerol cholinephosphotransferase (ExPASy ENZYME 2.7.8.2). 
Sterol. Steroid and Isoprenoid Metabolism 

Cholesterol, composed of four fused hydrocarbon rings with an alcohol at one end, moderates 
the fluidity of membranes in which it is incorporated. In addition, cholesterol is used in the synthesis of 
steroid hormones such as Cortisol, progesterone, estrogen, and testosterone. Bile salts derived from 
cholesterol facilitate the digestion of lipids. Cholesterol in the skin forms a barrier that prevents excess 
water evaporation from the body. Farnesyl and geranylgeranyl groups, which are derived from 
cholesterol biosynthesis intermediates, are post-translationally added to signal transduction proteins 
such as ras and protein-targeting proteins such as rab. These modifications are important for the 
activities of these proteins (Guyton, supra ; Stryer, supra, pp. 279-280, 691-702, 934). 

Mammals obtain cholesterol derived from both denovo biosynthesis and the diet The liver is 
the major site of cholesterol biosynthesis in mammals. Two acetyl-CoA molecules initially condense to 
form acetoacetyl-CoA, catalyzed by a thiolase. Acetoacetyl-CoA condenses with a third acetyl-CoA to 
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form hydroxymethylgiutaryl-CoA (HMG-CoA), catalyzed by HMG-CoA synthase. Conversion of 
HMG-CoA to cholesterol is accomplished via a sales of enzymatic steps known as the mevalonate 
pathway. The rat&limiting step is the conversion of HMG-CoA to mevalonate by HMG-CoA 
reductase. The drug lovastatin, a potent inhibitor of HMG-CoA reductase, is given to patients to 
reduce their serum cholesterol levels. Other mevalonate pathway enzymes include mevalonate kinase, 
phosphomevalonate kinase, diphosphomevalonate decarboxylase, isopentenyldiphosphate isomerase, 
dimethylallyl transferase, geranyl transferase, farnesyl-diphosphate farnesyltransferase, squalene 
monooxygenase, lanosterol synthase, lathosterol oxidase, and 7-dehydr ©cholesterol reductase. 

Cholesterol is used in the synthesis of steroid hormones such as Cortisol, progesterone, 
aldosterone, estrogen, and testosterone. First, cholesterol is converted to pregnenolone by cholesterol 
monooxygeaases. The other steroid hormones are synthesized from pregnenolone by a series of 
enzyme-catalyzed reactions including oxidations, isomerizations, hydroxylations, reductions, and 
demethylations. Examples of these enzymes include steroid A-isomerase, 3p-hydroxy-A 5 -steroid 
dehydrogenase, steroid 21 -monooxygenase, steroid 19-hydroxylase, and 3f$-hydroxysteroid 
dehydrogenase. Cholesterol is also the precursor to vitamin D. 

Numerous compounds contain 5 -carbon isoprene units derived from the mevalonate pathway 
intermediate isopentenyl pyrophosphate. Isoprenoid groups are found in vitamin K, ubiquinone, retinal, 
dolichol phosphate (a carrier of oligosaccharides needed for N-linked glycosylation), and farnesyl and 
geranylgeranyl groups that modify proteins. Enzymes involved include farnesyl transferase, polyprenyl 
transferases, dolichyl phosphatase, and dolichyl kinase. 
Sphingolipid Metabolism 

Sphingolipids are an important class of membrane lipids that contain sphingosine, a long chain 
amino alcohol. They are composed of one long-chain fatty acid, one polar head alcohol, and 
sphingosine or sphingosine derivative. The three classes of sphingolipids are sphingomyelins, 
cerebrosides, and gangliosides. Sphingomyelins, which contain phosphocholine or 
phosphoethanolamine as their head group, are abundant in the myelin sheath surrounding nerve cells. 
Galactocerebrosides, which contain a glucose or galactose head group, are characteristic of the brain. 
Other cerebrosides are found in nonneural tissues. Gangliosides, whose head groups contain multiple 
sugar units, are abundant in the brain, but are also found in nonneural tissues. 

Sphingolipids are built on a sphingosine backbone. Sphingosine is acylated to ceramide by the 
enzyme sphingosine acetyltransferase. Ceramide and phosphatidyl choline are converted to 
sphingomyelin by the enzyme ceramide choline phosphotransferase. Cerebrosides are synthesized by 
the linkage of glucose or galactose to ceramide by a transferase. Sequential addition of sugar residues 
to ceramide by transferase enzymes yields gangliosides. 
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Eicosanoid Metabolism 

Eicosanoids, including prostaglandins, prostacyclin, thromboxanes, and leukotrienes, are 20- 
carbon molecules derived from fatty acids. Eicosanoids are signaling molecules which have roles in 
pain, fever, and inflammation. The precursor of all eicosanoids is arachidonate, which is generated 
from phospholipids by phospholipase A^ and from diacylglycerols by diacylglycerol lipase, 
Leukotrienes are produced from arachidonate by the action of lipoxygenases. Prostaglandin synthase, 
reductases, and isomerases are responsible for the synthesis of the prostaglandins. Prostaglandins have 
roles in inflammation, blood flow, ion transport, synaptic transmission, and sleep. Prostacyclin and the 
thromboxanes are derived from a precursor prostaglandin by the action of prostacyclin synthase and 
thromboxane synthases, respectively. 
Ketone Body Metabolism 

Pairs of acetyl-CoA molecules derived from fatty acid oxidation in the liver can condense to 
form acetoacetyl-CoA, which subsequently forms acetoacetate, D-3-hydroxybutyrate, and acetone. 
These three products are known as ketone bodies. Enzymes involved in ketone body metabolism 
include HMG-CoA synthetase, HMG-CoA cleavage enzyme, D-3-hydroxybutyrate dehydrogenase, 
acetoacetate decarboxylase* and 3-ketoacyl-CoA transferase. Ketone bodies are a normal fuel supply 
of the heart and renal cortex. Acetoacetate produced by the liver is transported to cells where the 
acetoacetate is converted back to acetyl-CoA and enters the citric acid cycle. In times of starvation, 
ketone bodies produced from stored triacylglyerols become an important fuel source, especially for the 
brain. Abnormally high levels of ketone bodies are observed in diabetics. Diabetic coma can result if 
ketone body levels become too great. 
Lipid Mobilization 

Within cells, fatty acids are transported by cytoplasmic fatty acid binding proteins (Online 
Mendelian Inheritance in Man (OMIM) *1 34650 Fatty Acid-Binding Protein 1, Liver; FABP1). 
Diazepam binding inhibitor (DBI), also known as endozepine and acyl CoA-binding protein, is an 
endogenous y-aminobutyric acid (GAB A) receptor ligand which is thought to down-regulate the effects 
of GAB A. DBI binds medium- and long-chain acyl-CoA esters with very high affinity and may 
function as an intracellular carrier of acyl-CoA esters (OMIM *125950 Diazepam Binding Inhibitor; 
DBI; PROSITE PDOC00686 Acyl-CoA-binding protein signature). 

Fat stored in liver and adipose triglycerides may be released by hydrolysis and transported in 
the blood. Free fatty acids are transported in the blood by albumin. Triacylglycerols and cholesterol 
esters in the blood are transported in lipoprotein particles. The particles consist of a core of 
hydrophobic lipids surrounded by a shell of polar lipids and apolipoprotetns. The protein components 
serve in the solubilization of hydrophobic lipids and also contain cell-targeting signals. Lipoproteins 
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include chylomicrons, chylomicron remnants, very-low-density lipoproteins (VLDL), intermediate- 
density lipoproteins (IDL), low-density lipoproteins (LDL), and high-density lipoproteins (HDL). 
There is a strong inverse correlation between the levels of plasma HDL and risk of premature coronary 
heart disease. 

5 Triacylglycerols in chylomicrons and VLDL are hydrolyzed by lipoprotein lipases that line 

blood vessels in muscle and other tissues that use fatty acids. Cell surface LDL receptors bind LDL 
particles which are thai internalized by endocytosis. Absence of the LDL receptor, the cause of the 
disease familial hypercholesterolemia, leads to increased plasma cholesterol levels and ultimately to 
atherosclerosis. Plasma cholesteryl ester transfer protein mediates the transfer of cholesteryl esters 

10 from HDL to apolipoprotein B-containing lipoproteins. Cholesteryl ester transfer protein is important 
in the reverse cholesterol transport system and may play a role in atherosclerosis (Y amashita, S. et al. 
(1997) Cum Opin. Lipidol. 8:101-110). Macrophage scavenger receptors, which bind and internalize 
modified lipoproteins, play a role in lipid transport and may contribute to atherosclerosis (Greaves, 
D.R. et al. (1998) Curr. Opin. Lipidol. 9:425-432). 

15 Proteins involved in cholesterol uptake and biosynthesis are tightly regulated in response to 

cellular cholesterol levels. The sterol regulatory element binding protein (SREBP) is a sterol-responsive 
transcription factor. Under normal cholesterol conditions, SREBP resides in the ER membrane. When 
cholesterol levels are low, a regulated cleavage of SREBP occurs which releases the extracellular 
domain of the protein. This cleaved domain is then transported to the nucleus where it activates the 

2 o transcription of the LDL receptor gene, and genes encoding enzymes of cholesterol synthesis, by 

binding the sterol regulatory element (SRE) upstream of the genes (Y ang, J. et al. (1995) J. Biol. Chem. 
270:12152-12161). Regulation of cholesterol uptake and biosynthesis also occurs via the oxysterol- 
binding protein (OSBP). OSBP is a high-affinity intracellular receptor for a variety of oxysterols that 
down-regulate cholesterol synthesis and stimulate cholesterol esterification (Lagace> T.A. et al. (1997) 
25 Biochem. J. 326:205-213). 
Beta-oxidation 

Mitochondrial and peroxisomal beta-oxidation enzymes degrade saturated and unsaturated fatty 
acids by sequential removal of two-carbon units from CoA-activated fatty acids. The main beta- 
oxidation pathway degrades both saturated and unsaturated fatty acids while the auxiliary pathway 

3 o performs additional steps required for the degradation of unsaturated fatty acids. 

The pathways of mitochondrial and peroxisomal beta-oxidation use similar enzymes, but have 
different substrate specificities and functions. Mitochondria oxidize short-, medium-, and long-chain 
fatty acids to produce energy for cells. Mitochondrial beta-oxidation is a major energy source for 
cardiac and skeletal muscle. In liver, it provides ketone bodies to the peripheral circulation when 
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glucose levels are low as in starvation, endurance exorcise, and diabetes (Eaton, S. et al. (1996) 
Biochem. J. 320:345-357). Peroxisomes oxidize medium-, long-, and very-long-chain fatty acids, 
dicarboxylic fatty adds, branched fatty acids, prostaglandins, xenobiotics, and bile acid intermediates. 
The chief roles of peroxisomal beta-oxidation are to shorten toxic lipophilic carboxylic acids to 
5 facilitate their excretion and to shorten very-long-chain fatty acids prior to mitochondrial beta-oxidation 
(Mannaerts, G.P. andP.P. van Veldhoven (1993) Biochimie 75:147-158). 

Enzymes involved in beta-oxidation include acyl CoA synthetase, carnitine acyltransferase, 
acyl CoA dehydrogenases, enoyl CoAhydratases, L-3-hydrbxyacyl CoA dehydrogenase, P-ketothiolase, 
2,4-dienoyl CoA reductase, and isomerase. 

io Lipid Cleavage and Degradation 

Triglycerides are hydrolyzed to fatty acids and glycerol by lipases. Lysophospholipases 
(LPLs) are widely distributed enzymes that metabolize intracellular lipids, and occur in numerous 
isoforms. Small isofonns, approximately 15-30 kD, function as hydrolases; large isoforms, those 
exceeding 60 kD, function both as hydrolases and transacylases. A particular substrate for LPLs, 

15 lysophosphatidylcholine, causes lysis of cell membranes when it is formed or imported into a ceil. 
LPLs are regulated by lipid factors including acylcarnitine, arachidonic acid, and phosphatide acid. • 
These lipid factors are signaling molecules important in numerous pathways, including the 
inflammatory response. (Anderson, R. et al. (1994) Toxicol. Appl. Pharmacol. 125:176-183; Selle, H. 
et al. (1993); Eur. J. Biochem. 212:411-416.) 

2 o The secretory phospholipase A 2 (PLA2) superf amily comprises a number of heterogeneous 

enzymes whose common feature is to hydrolyze the sn-2 fatty acid acyl ester bond of 
phosphogjycerides. Hydrolysis of the glycerpphospholipids releases free fatty acids and 
lysophospholipids. PLA2 activity generates precursors for the biosynthesis of biologically active lipids, 
hydroxy fatty acids, and platdet-activating factor. PLA2 hydrolysis of the sn-2 ester bond in 

2 5 phospholipids generates free fatty acids, such as arachidonic acid and lysophospholipids. 

Carbon and Carbohydrate Metabolism 

Carbohydrates, including sugars or saccharides, starch, and cellulose, are aldehyde or ketone 
compounds with multiple hydroxyl groups. The importance of carbohydrate metabolism is 
demonstrated by the sensitive regulatory system in place for maintenance of blood glucose levels. Two 

3 o pancreatic hormones, insulin and glucagon, promote increased glucose uptake and storage by cells, and 

increased glucose release from cells, respectively. Carbohydrates have three important roles in 
mammalian cells. First, carbohydrates are used as energy stores, fuels, and metabolic intermediates. 
Carbohydrates are broken down to form energy in glycolysis and are stored as glycogen for later use. 
Second, the sugars deoxyribose and ribose form part of the structural support of DNA and RNA, 
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respectively. Third, carbohydrate modifications are added to secreted and membrane proteins and lipids 
as they traverse the secretory pathway. Cell surface carbohydrate-containing macromolecules, 
including glycoproteins, glycdlipids, and transmembrane proteoglycans, mediate adhesion with other 
cells and with components of the extracellular matrix. The extracellular matrix is comprised of diverse 
glycoproteins, glycosaminoglycans (GAGs), and carbohydrate-binding proteins which are secreted from 
the cell and assembled into an organized meshwork in close association with the cell surface. The 
interaction of the cell with the surrounding matrix profoundly influences cell shape, strength, flexibility, 
motility, and adhesioa These dynamic properties are intimately associated with signal transduction 
pathways controlling cell proliferation and differentiation, tissue construction, and embryonic 
development 

Carbohydrate metabolism is altered in several disorders including diabetes mdlitus, 
hyperglycemia, hypoglycemia, galactosemia, galactokinase deficiency, and UDP-galactose4-epimerase 
deficiency (Fauci, AS. et al. (1998) Harrison's Principles of Internal Medicine , McGraw-Hill, New 
York NY, pp. 2208-2209). Altered carbohydrate metabolism is associated with cancer. Reduced GAG 
and proteoglycan expression is associated with human lung carcinomas (Nackaerts, K. et al. (1997) Int. 
J. Cancer 74:335-345). The carbohydrate determinants sialyl Lewis A and sialyl Lewis X are 
frequently expressed on human cancer cells (Kannagi, R. (1997) Glycoconj. J. 14:577-584). . 
Alterations of the N-linked carbohydrate core structure of cell surface glycoproteins are linked to colon 
and pancreatic cancers (Schwarz, R.E. et al. (1996) Cancer Lett. 107:285-291). Reduced expression 
of the Sda blood group carbohydrate structure in cell surface glycolipids and glycoproteins is observed 
in gastrointestinal cancer (Dohi, T. et al. (1996) Int J. Cancer 67:626-663). (Carbon and 
carbohydrate metabolism is reviewed in Stryer, L. (1995) Biochemistry W.H. Freeman and Company, 
New York NY; Lehninger, AL. (1982) Principles of Biochemistry Worth Publishers Inc., New York 
NY; and Lodish, H. et al. (1995) Molecular Cell Biology Scientific American Books, New York NY.) 
Gl ycolysis 

Enzymes of the glycolytic pathway convert the sugar glucose to pyruvate while simultaneously 
producing ATP. The pathway also provides building blocks for the synthesis of cellular components 
such as long-chain fatty acids. After glycolysis, pyrvuate is converted to acetyl-Coenzyme A which, in 
aerobic organisms, enters the citric acid cycle Glycolytic enzymes include hexokinase, phosphoglucose 
isomerase, phosphofructokinase, aldolase, triose phosphate isomerase, glycer aldehyde 3-phosphate 
dehydrogenase, phosphoglycerate kinase, phosphoglyceronnitase, endlase, and pyruvate kinase. Of 
these, phosphofructokinase, hexokinase, and pyruvate kinase are important in regulating the rate of 
glycolysis. 
Gluconeogenesis 
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Gluconeogenesis is the synthesis of glucose from noncarbohydrate precursors such as lactate 
and amino acids. The pathway, which functions mainly in times of starvation and intense exercise, 
occurs mostly in the liver and kidney. Responsible enzymes include pyruvate carboxylase, 
phosphoenolpyruvate carboxykinase, fructose 1,6-bisphosphatase, and glucose-6-phosphatase. 
Pentose Phosphate Pathway 

Pentose phosphate pathway enzymes are responsible for generating the reducing agent 
NADPH, while at the same time oxidizing glucose-6-pbosphate to ribose-5-phosphate. Ribose-5- 
phosphate and its derivatives become part of important biological molecules such as ATP, Coenzyme 
A, NAD + , FAD, RNA, and DNA The pentose phosphate pathway has both oxidative and non- 
oxidative branches. The oxidative branch steps, which are catalyzed by the enzymes glucoses- 
phosphate dehydrogenase, lactonase, and 6-phosphogluconate dehydrogenase, convert glucoses- 
phosphate and NADP* to ribulose-6-phosphate and NABPH. The non-oxidative branch steps, which 
are catalyzed by the enzymes phosphopentose isomer ase, phosphqpentose epimerase, transketolase, and 
transaldolase, allow the interconversion of three-, four-, five-, six-, and seven-carbon sugars. 
Glucouronate Metabolism 

Glucuronate is a monosaccharide which, in the form of D-glucuronic acid, is found in the 
GAGs chondroitin and dennataa D-glucuronic acid is also important in the detoxification and 
excretion of foreign organic compounds such as phenol. Enzymes involved in glucuronate metabolism 
include UDP-glucose dehydrogenase and glucuronate reductase. 
Disaccharide Metabolism 

Disaccharides must be hydrolyzed to monosaccharides to be digested. Lactose* a disaccharide 
found in milk, is hydrolyzed to galactose and glucose by the enzyme lactase. Maltose is derived from 
plant starch and is hydrolyzed to glucose by the enzyme maltase. Sucrose is derived from plants and is 
hydrolyzed to glucose and fructose by the enzyme sucrase. Trehalose, a disaccharide found mainly in 
insects and mushrooms, is hydrolyzed to glucose by the enzyme trehalase (OMIM *275360 Trehalase; 
Ruf, J. et al. (1990) J. Biol. Chem 265:15034-15039). Lactase, maltase, sucrase, and trehalase are 
bound to mucosal cells lining the small intestine, where they participate in the digestion of dietary 
disaccharides. The enzyme lactose synthetase, composed of the catalytic subunit galactosyltransferase 
and the modifier subunit a-lactalbumin, converts UDP-galactose and glucose to lactose in the mammary 
gjands. 

Glycogen. Starch, and Chitin Metabolism 

Glycogen is the storage form of carbohydrates in mammals. Mobilization of glycogen 
maintains glucose levels between meals and during muscular activity. Glycogen is stored mainly in the 
liver and in skeletal muscle in the form of cytoplasmic granules. These granules contain enzymes that 



35 



WO 01/62927 



PCT/US01/06059 



catalyze the synthesis and degradation of glycogen, as weQ as enzymes that regulate these processes. 
Enzymes that catalyze the degradation of glycogen include glycogen phosphorylase, a transferase, a- 
1,6-gJucosidase, and phosphoglucomutase. Enzymes that catalyze the synthesis of glycogen include 
UDP-glucose pyrophosphatase, glycogen synthetase, a branching enzyme, and nucleoside 
5 diphosphokinase. The enzymes of glycogen synthesis and degradation are tightly regulated by the 
hormones insulin, glucagon, and epinephrine. Starch, a plant-derived polysaccharide* is hydrolyzed to 
maltose, maltotriose, and a-dextrin by a-amylase, an enzyme secreted by the salivary glands and 
pancreas. Chitin is a polysaccharide found in insects and Crustacea. A chitotriosidase is secreted by 
macrophages and may play a role in the degradation of chitin-containing pathogens (Boot, R.G. et al. 

10 (1995) J. Biol Chem. 270:26252-26256). 
Peptidoglycans and Glvcosaminoglvcans 

Glycosaminoglycans (GAGs) arc anionic linear unbranched polysaccharides composed of 
repetitive disaccharide units. These repetitive units contain a derivative of an amino sugar, either 
glucosamine or galactosamine. GAGs exist free or as part of proteoglycans, large molecules composed 

15 of a core protein attached to one or more GAGs. GAGs are found on the cell surface, inside cells, and 
in the extracellular matrix. Changes in GAG levels are associated with several autoimmune diseases 
including autoimmune thyroid disease, autoimmune diabetes mellitus, and systemic lupus 
erythematosus (Hansen, C. et al. (1996) Clin. Exp. Rheum. 14 (Suppl. 15):S59-S67). GAGs include 
chondroitin sulfate, keratan sulfate, heparin, heparan sulfate, dermatan sulfate, and hyaluronan. 

2 o The GAG hyaluronan (HA) is found in the extracellular matrix of many cells, especially in soft 

connective tissues, and is abundant in synovial fluid (Pitsillides, A A. et al. (1993) Int. J. Exp. Pathol. 
74:27-34). HA seems to play important roles in cell regulation, development, and differentiation 
(Laurent, T.C. and J.R. Fraser (1992) FASEB J. 6:2397-2404). Hyaluronidase is an enzyme that 
degrades HA to oligosaccharides. Hyaluronidases may function in cell adhesion, infection, 
25 angiogenesis, signal transduction, reproduction, cancer, and inflammation 

Proteoglycans, also known as peptidoglycans, are found in the extracellular matrix of 
connective tissues such as cartilage and are essential for distributing the load in weight-bearing joints. 
Cell-surface-attached proteoglycans anchor ceils to the extracellular matrix. Both extracellular and 
cell-surface proteoglycans bind growth factors, facilitating their binding to cell-surface receptors and 

3 o subsequent triggering of signal transduction pathways. 

Amino Acid and Nitrogen Metabolism 

NH 4 * is assimilated into amino adds by the actions of two enzymes, glutamate 
dehydrogenase and glutamine synthetase. The carbon skeletons of amino acids come from the 
intermediates of glycolysis, the pentose phosphate pathway, or the citric acid cycle. Of the twenty 
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amino acids used in proteins, humans can synthesize only thirteen (nonessential amino acids). Hie 
remaining nine must come from the diet (essential amino acids). Enzymes involved in nonessential 
amino acid biosynthesis include glutamate kinase dehydrogenase, pyrroline carboxylate reductase, 
asparagine synthetase, phenylalanine oxygenase, methionine adenosyltransferase, 
adenosylhomocysteinase, cystathionine P-synthase, cystathionine y-lyase, phosphoglycerate 
dehydrogenase, phosphoserine transaminase, phosphoserine phosphatase, serine 
hydroxylmethyltransferase, and glycine synthase. 

Metabolism of amino acids takes place almost entirely in the liver, where the amino group is 
removed by aminotransferases (transaminases), for example, alanine aminotransferase. The amino 
group is transferred;to a-ketoglutarate to form glutamate. Glutamate dehydrogenase converts 
glutamate to NH 4 + and a-ketoglutarate. NH 4 + is converted to urea by the urea cycle which is 
catalyzed by the enzymes arginase ; ornithine transcarbamoylase, arginosuccinate synthetase, and 
arginosuccinase. Carbamoyl phosphate synthetase is also involved in urea formation. Enzymes 
involved in the metabolism of the carbon skeleton of amino acids include serine dehydratase, 
asparaginase, glutaminase, propionyl CoA carboxylase, methylmalonyl CoA mutase, branched-chain 
a-keto dehydrogenase complex, isovaleryl CoA dehydrogenase, P-methylcrotonyl CoA carboxylase, 
phenylalanine hydroxylase, p-hydroxylphenylpyruvate hydroxylase, and homogentisate oxidase. 

Polyamines, which include spermidine, putrescine, and spermine, bind tightly to nucleic acids 
and are abundant in rapidly proliferating cells. Enzymes involved in poly amine synthesis include 
ornithine decarboxylase. 

Diseases involved in amino acid and nitrogen metabolism include hyperammonemia, 
carbamoyl phosphate synthetase deficiency, urea cycle enzyme deficiencies, methylmalonic aciduria, 
maple syrup disease, alcaptonuria, and phenylketonuria. 
Energy Metabolism 

Cells derive energy from metabolism of ingested compounds that may be roughly categorized 
as carbohydrates, fats, or proteins. Energy is also stored in polymers such as triglycerides (fats) and 
glycogen (carbohydrates). Metabolism proceeds along separate reaction pathways connected by key 
intermediates such as acetyl coenzyme A (acetyl-CoA). Metabolic pathways feature anaerobic and 
aerobic degradation, coupled with the energy-requiring reactions such as phosphorylation of 
adenosine diphosphate (ADP) to the triphosphate (ATP) or analogous phosphorylations of guanosine 
(GDP/GTP), uridine (UDP/UTP), or cytidine (CDP/CTP). Subsequent dephosphorylation of the 
triphosphate drives reactions needed for cell maintenance, growth, and proliferation. 

Digestive enzymes convert carbohydrates and sugars to glucose; fructose and galactose are 
converted in the liver to glucose. Enzymes involved in these conversions include galactose- 1- 
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phosphate uridyl transferase and UDP-galactose-4 epimerase. In the cytoplasm, glycolysis converts 
glucose to pyruvate in a series of reactions coupled to ATP synthesis. 

Pyruvate is transported into the mitochondria and converted to acetyl-CoA for oxidation via 
the citric acid cycle,' involving pyruvate dehydrogenase components, dihydrolipoyl transacetylase, and 
dihydrolipoyl dehydrogenase. Enzymes involved in the citric acid cycle include: citrate synthetase, 
aconitases, isocitrate dehydrogenase, alpha-ketoglutarate dehydrogenase complex including 
transsuccinylases, succinyl CoA synthetase, succinate dehydrogenase, fumarases, and malate 
dehydrogenase. Acetyl CoA is oxidized to C0 2 with concomitant formation of NADH, FADH 2 , and 
OTP. In oxidative phosphorylation, the transport of electrons from NADH and FADH 2 to oxygen by 
dehydrogenases is coupled to the synthesis of ATP from ADP and ? { by the ATPase complex in 
the mitochondrial inner membrane. Enzyme complexes responsible for electron transport and ATP 
synthesis include the F^F-, ATPase complex, ubiquinonc(GoQ)-cytochrome c reductase, ubiquinone 
reductase, cytochrome b, cytochrome FeS protein, and cytochrome c oxidase. 

Triglycerides are hydrolyzed to fatty acids and glycerol by lipases. Glycerol is then 
phosphorylated to glycerol-3-phosphate by glycerol kinase and glycerol phosphate dehydrogenase, 
and degraded by the glycolysis. Fatty acids are transported into the mitochondria as fatty acyl- 
carnitine esters and undergo oxidative degradation. 

In addition to metabolic disorders such as diabetes and obesity, disorders of energy 
metabolism are associated with cancers (Dorward, A et al. (1997) J. Bioenerg. Biomembr. 29:385- 
392), autism (Lombard, J. (1998) Med. Hypotheses 50:497-500), neurodegenerative disorders (Alexi, 
T. et al. (1998) Neuroreport 9:R57-64), and neuromuscular disorders (DiMauro, S. et al. (1998) 
Biochim. Biophys. Acta 1366:199-210). The myocardium is heavily dependent on oxidative 
metabolism, so metabolic dysfunction often leads to heart disease (DiMauro, S. and M. Hirano (1998) 
Curr. Opin. Cardiol. 13:190-197). 

For a review of energy metabolism enzymes and intermediates, see Stryer, L. et al. (1995) 
Biochemistry , W.H. Freeman and Co., San Francisco CA, pp. 443-652. For a review of energy 
metabolism regulation, see Lodish, H. et al. (1995) Molecular Cell Biology. Scientific American 
Books, New York NY, pp. 744-770. 
Cof actor Metabolism 

Cofactors, including coenzymes and prosthetic groups, are small molecular weight inorganic 
or organic compounds that are required for the action of an enzyme. Many cofactors contain vitamins 
as a component. Cofactors include thiamine pyrophosphate, flavin adenine dinucleotide, flavin 
mononucleotide, nicotinamide adenine dinucleotide, pyridoxal phosphate, coenzyme A, 
tetrahydrofolate, lippamide, and heme. The vitamins biotin and cobalamin are associated with 
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enzymes as well. Heme, a prosthetic group found in myoglobin and hemoglobin, consists of 
protoporphyrin group bound to iron. Porphyrin groups contain four substituted pyrroles covalently 
joined in a ring, often with a bound metal atom. Enzymes involved in porphyrin synthesis include 5- 
aminolevulinate synthase, 5-aminolevulinate dehydrase, porphobilinogen deaminase, and cosynthase. 
5 Deficiencies in heme formation cause porphyrias. Heme is broken down as a part of erythrocyte 
turnover. Enzymes involved in heme degradation include heme oxygenase and biliverdin reductase. 

Iron is a required cof actor for many enzymes. Besides the heme-containing enzymes, iron is 
found in iron-sulfur clusters in proteins including aconitase, succinate dehydrogenase, and NADH-Q 
reductase. Iron is transported in the blood by the protein transferrin. Binding of transferrin to the 

l o transferrin receptor on cell surfaces allows uptake by receptor mediated endocytosis. Cytosolic iron is 
bound to ferritin protein. 

A molybdenum-containing cefaclor (molybdopterin) is found in enzymes including sulfite 
oxidase, xanthine dehydrogenase, and aldehyde oxidase. Molybdopterin biosynthesis is performed by 
two molybdenum cofactor synthesizing enzymes. Deficiencies in these enzymes cause mental 

15 retardation and lens dislocation. Other diseases caused by defects in cofactor metabolism include 
pernicious anemia and methylmalonic aciduria. 
Secretion and Trafficking 

Eukaryotic cells are bound by a lipid bilayer membrane and subdivided into functionally 
distinct, membrane bound compartments. The membranes maintain the essential differences between 

20 the cytosol, the extracellular environment, and the lumenal space of each intracellular organelle. As • 
lipid membranes are highly impermeable to most polar molecules, transport of essential nutrients, 
metabolic waste products, cell signaling molecules, macromolecules and proteins across lipid 
membranes and between organelles must be mediated by a variety of transport-associated molecules. 
Protein Trafficking 

25 In eukaryotes, some proteins are synthesized on ER-bound ribosomes, co-translationally 

imported into the ER, delivered from the ER to the Golgi complex for post-translational processing and 
sorting, and transported from the Golgi to specific intracellular and extracellular destinations. All cells 
possess a constitutive transport process which maintains homeostasis between the cell and its 
environment In many differentiated cell types, the basic machinery is modified to carry out specific 

30 transport functions. For example, in endocrine glands, hormones and other secreted proteins are 
packaged into secretory grannies for regulated exocytosis to the cell exterior. In macrophage, foreign 
extracellular material is engulfed (phagocytosis) and delivered to lysosomes for degradatioa In fat and 
muscle cells, glucose transporters are stored in vesicles which fuse with the plasma membrane only in 
response to insulin stimulation. 
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The Secretory Pathway 

Synthesis of most integral memhrane proteins, secreted proteins, and proteins destined for the 
lumen of a particular organelle occurs on ER-bound ribosomes. These proteins are co-translationally 
imported into the ER. The proteins leave the ER via membrane-bound vesicles which bud off the ER at 
5 specific sites and fiise with each other (homotypic fusion) to form the ER-Golgi Intermediate 

Compartment (ERGIC). The ERGIC matures progressively through the cis, medial, and trans asternal 
stacks of the Golgi, modifying the enzyme composition by retrograde transport of specific Golgi 
enzymes. In this way, proteins moving through the Golgi undergo post-translational modification, such 
as glycosylation. The final Golgi compartment is the Trans-Golgi Network (TGN), where both 

1 o membrane and lumenal proteins are sorted for their final destination Transport vesicles destined for 

intracellular compartments, such as the lysosome, bud off the TGN. What remains is a secretory 
vesicle which contains proteins destined for the plasma membrane, such as receptors, adhesion 
molecules, and ion channels, and secretory proteins, such as hormones, neurotransmitters, and digestive 
enzymes. Secretory vesicles eventually fuse with the plasma membrane (Glick, B.S. and V. Malhotra 
15 (1998) Cell 95:883-889). 

The secretory process can be constitutive or regulated Most cells have a constitutive pathway 
for secretion, whereby vesicles derived from maturation of the TGN require no specific signal to fuse 
with the plasma membrane. In many cells, such as endocrine cells, digestive cells, and neurons, vesicle 
pools derived from the TGN collect in the cytoplasm and do not fuse with the plasma membrane until 

2 o they are directed to by a specific signal. 

Endocvtosis 

Endocytosis, wherein cells internalize material from the extracellular environment, is essential 
for transmission of neuronal, metabolic, and proliferative signals; uptake of many essential nutrients; 
and defense against invading organisms. Most cells exhibit two forms of endocytosis. The first, 

2 5 phagocytosis, is an actin-driven process exemplified in macrophage and neutrophils. Material to be 

endocytosed contacts numerous cell surface receptors which stimulate the plasma membrane to extend 
and surround the particle, enclosing it in a membrane-bound phagosome. In the mammalian immune 
system, IgG-coated particles bind Fc receptors on the surface of phagocytic leukocytes. Activation of 
the Fc receptors initiates a signal cascade involving src-family cytosolic kinases and the monomeric 

3 o GTP-binding (G) protein Rho. The resulting actin reorganization leads to phagocytosis of the particle. 

This process is an important component of the humoral immune response, allowing the processing and 
presentation of bacterial-derived peptides to antigen-specific T-lymphocytes. 

The second form of endocytosis, pinocytosis, is a more generalized uptake of material from the 
external milieu. Like phagocytosis, pinocytosis is activated by ligand binding to cell surface receptors. 
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Activation of individual receptors stimulates an internal response that includes coalescence of the 
receptor-ligand complexes and formation of clathrin-coated pits. Invagination of the plasma membrane 
at clathrin-coated pits produces an endocytic vesicle within the cell cytoplasm. These vesicles undergo 
homotypic fusion to form an early autosomal (EE) compartment The tubulovesicular EE serves as a 
5 sorting site for incoming material. ATP-driven proton pumps in the EE membrane lowers the pH of the 
EE lumen (pH 6. 3-6.8). The acidic environment causes many ligands to dissociate from their receptors. 
The receptors, along with membrane and other integral membrane proteins, are recycled back to the 
plasma membrane by budding off the tubular extensions of the EE in recycling vesicles (RV). This 
selective removal of recycled components produces a carrier vesicle containing ligand and other 
10 material from the external environment. The carrier vesicle fuses with TGN-dearived vesicles which 
contain hydrolytic enzymes. The acidic environment of the resulting late endosome (LE) activates the 
hydrolytie enzymes which degrade the ligauus and other material. As digestion takes place, the LE 
fuses with the lysosome where digestion is completed (Mellman, I. (1996) Annu. Rev. Cell Dev. Biol. 
12:575-625). 

1 5 Recycling vesicles may return directly to the plasma membrane. Receptors internalized and 

returned directly to the plasma membrane have a turnover rate of 2-3 minutes. Some RVs undergo 
microtubule-directed relocation to a perinuclear site, from which they then return to the plasma 
membrane. Receptors following this route have a turnover rate of 5- 10 minutes. Still other RVs are 
retained within the cell until an appropriate signal is received (Mellman, supra ; and James, D.E. et al. 

20 (1994) Trends Cell Biol. 4:120-126). 
Vesicle Formation 

Several steps in the transit of material along the secretory and endocytic pathways require the 
formation of transport vesicles. Specifically, vesicles form at the transitional endoplasmic reticulum 
(tER), the rim of Golgi cisternae, the face of the Trans-Golgi Network (TGN), the plasma membrane 

2 5 (PM), and tubular extensions of the endosomes. The process begins with the budding of a vesicle out of 
the donor membrane. The membrane-bound vesicle contains proteins to be transported and is 
surrounded by a protective coat made up of protein subunits recruited from the cytosol The initial 
budding and coating processes are controlled by a cytosolic ras-like GTP-binding protein, ADP- 
ribosylating factor (Arf), and adapter proteins (AP). Different isoforms of both Arf and AP are 

30 involved at different sites of budding. Another small G-protein, dynamin, forms a ring complex around 
the neck of the forming vesicle and may provide the mechanochemical force to accomplish the final step 
of the budding process. The coated vesicle complex is then transported through the cytosol. During the 
transport process, Arf-bound GTP is hydrolyzed to GDP and the coat dissociates from the transport 
vesicle (West, M. A et al. (1997) J. Cell Biol. 138:1239-1254). Two different classes of coat protein 
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have also been identified Clathrin coats form on the TGN and PM surfaces, whereas coatomer or COP 
coats form on the ER and Golgi. COP coats can further be distinguished as COPI, involved in 
retrograde traffic through the Golgi and from the Golgi to the ER, and COPII, involved in anterograde 
traffic from the ER to the Golgi (Mellman, supra). The COP coat consists of two major components, a 
5 G-protein (Arf or Sar) and coat protomer (coatomer). Coatomer is an equimolar complex of seven 
proteins, teamed alpha-, beta-, beta'-, gamma-, delta-, epsilon- and zeta-COP. (Harter, C. andF.T. 
Wiedand (1998) Proc. Natl. Acad. Sci. USA 95 : 1 1 649-1 1 654.) 
Membrane Fusion 

Transport vesicles undergo homotypic or heterotypic fusion in the secretory and endocytotic 

1 o pathways. Molecules required for appropriate targeting and fusion of vesicles with their target 

membrane include proteins incorporated in the vesicle membrane, the target membrane, and proteins 
recruited from the cytosol. During budding of the vesicle from the donor compartment, an integral 
membrane protein, VAMP (vesicle-associated membrane protein) is incorporated into the vesicle. Soon 
after the vesicle uncoats, a cytosolic prenylated GTP-binding protein, Rab (a member of the Ras 
1 5 superf amily), is inserted into the vesicle membrane. GTP-bound Rab proteins are directed into nascent 
transport vesicles where they interact with VAMP. Following vesicle transport, GTPase activating 
proteins (GAPs) in the target membrane convert Rab proteins to the GDP-bound form. A cytosolic 
protein, guaninernucleotide dissociation inhibitor (GDI) helps return GDP-bound Rab proteins to their 
membrane of origin. Several Rab isoforms have been identified and appear to associate with specific 

2 o compartments within the cell. Rab proteins appear to play a role in mediating the function of a viral 

gene, Rev, which is essential for replication of HTV-1, the virus responsible for AIDS (Flavell, RA et 
al (1996) Proc. Natl. Acad. Sci. USA 93:4421-4424). 

Docking of the transport vesicle with the target membrane involves the formation of a complex 
between the vesicle SNAP receptor (v-SNARE), target membrane (t~) SNAREs, and certain other 

2 5 membrane and cytosolic proteins. Many of these other proteins have been identified although their 

exact functions in the docking complex remain uncertain (Tellam, J.T. et al. (1995) J. Biol. Chem. 
270:5857-5863; and Hata, Y. andT.C. Sudhof (1995) J. Biol. Chem. 270:13022-13028). 
N-ethylmaleimide sensitive factor (NSF) and soluble NSF-attachment protein (a-SNAP and p-SNAP) 
are two such proteins that are conserved from yeast to man and function in most intracellular membrane 

3 o fusion reactions. Seel represents a family of yeast proteins that function at many different stages in the 

secretory pathway including membrane fusion. Recently, mammalian homologs of Seel , called 
Munc-18 proteins, have been identified (Katagiri, H. et al. (1995) J. Biol. Chem. 270:4963-4966; Hata 
etal. supra) . 

The SNARE complex involves three SNARE molecules, one in the vesicular membrane and 
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two in the target membrane. Synaptotagmin is an integral membrane protein in the synaptic vesicle 
which.associates with the t-SNARE syntaxin in the docking complex. Synaptotagmin binds calcium in 
a complex with negatively charged phospholipids, which allows the cytosolic SNAP protein to displace 
synaptotagmin from syntaxin and fusion to occur. Thus, synaptotagmin is a negative regulator of 
5 fusion in the neuron (Littleton, J.T. et al. (1993) Cell 74:1 125-1 134). The most abundant membrane 
protein of synaptic vesicles appears to be the glycoprotein synaptqphysin, a 38 kDa protein with four 
transmembrane domains. 

Specificity between a vesicle and its target is derived from the v-SNARE, t-SNAREs, and 
associated proteins involved Different isoforms of SNAREs and Rabs show distinct cellular and 
10 subcellular distributions. VAMP-l/synaptobrevin, membrane-anchored synaptosome-associated 

protein of 25 kDa (SNAP-25), syntaxin-1, Rab3A, Rabl5, and Rab23 are predominantly expressed in 
the brain and nervous system. Different syntaxin, VAMP, and Rab proteins are associated with 
distinct subcellular compartments and their vesicular carriers. 
Nuclear Transport 

1 5 Transport of proteins and RN A between the nucleus and Hie cytoplasm occurs through nuclear 

pore complexes (NPCs). NPC -mediated transport occurs in both directions through the nuclear 
envelope. All nuclear proteins are imported from the cytoplasm, their site of synthesis: tRNAand 
mRNA are exported from the nucleus, their site of synthesis, to the cytoplasm, their site of function. 
Processing of small nuclear RNAs involves export into the cytoplasm, assembly with proteins and 

2 o modifications such as hypermethylation to produce small nuclear ribonuclear proteins (snRNPs), and 
subsequent import of the snRNPs back into the nucleus. The assembly of ribosomes requires the initial 
import of ribosomal proteins from the cytoplasm, their incorporation with RNA into ribosomal 
subnnits, and export back to the cytoplasm. (Gtirlich, D. andl.W. Mattaj (1996) Science 271:1513- 
1518.) 

2 5 The transport of proteins and mRNAs across the NPC is selective, dependent on nuclear 

localization signals, and generally requires association with nuclear transport factors. Nuclear 
localization signals (NLS) consist of short stretches of amino acids enriched in basic residues. NLS are 
found on proteins that are targeted to the nucleus, such as the glucocorticoid receptor. The NLS is 
recognized by the NLS receptor, impartin, which then interacts with the monomelic GTP-binding 

3 o protein Ran. This NLS protein/receptor/Ran complex navigates the nuclear pore with the help of the 

homodimeric protein nuclear transport factor 2 (NTF2). NTF2 binds the GDP-bound form of Ran and 
to multiple proteins of the nuclear pore complex containing FXFG repeat motifs, such as p62. 
(Paschal, B. et al. (1997) J. Biol. Chem. 272:21534-21539; and Wong, D.H. et al. (1997) Mol. Cell 
Biol. 17:3755-3767). Some proteins are dissociated before nuclear mRNAs are transported across the 
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NPC while others are dissociated shortly after unclear mRNA transport across the NPC and are 
reimported into the nucleus. 
Disease Correlation 

The etiology of numerous human diseases and disorders can be attributed to defects in the 
5 transport or secretion of proteins. For example, abnormal hormonal secretion is linked to disorders 
such as diabetes insipidus (vasopressin), hypo*- and hypoglycemia (insulin, glucagon), Grave's disease 
and goiter (thyroid hormone), and dishing' s and Addison's diseases (adrenocorticotropic hormone, 
ACTH). Moreover, cancer cells secrete excessive amounts of hormones or other biologically active 
peptides. Disorders related to excessive secretion of biologically active peptides by tumor cells include 

l o fasting hypoglycemia due to increased insulin secretion from insulinoma-islet cell tumors; hypertension 
due to increased epinephrine and norepinephrine secreted from pheochromocytomas of the adrenal 
medulla and sympathetic paraganglia; and carcinoid syndrome, which is characterized by abdominal 
cramps, diarrhea, and valvular heart disease caused by excessive amounts of vasoactive substances 
such as serotonin, bradykinin, histamine, prostaglandins, and polypeptide hormones, secreted from 

15 intestinal tumors. Biologically active peptides that are ectopically synthesized in and secreted from 
tumor cells include ACTH and vasopressin (lung and pancreatic cancers); parathyroid hormone (lung 
and bladder cancers); calcitonin (lung and breast cancers); and thyroid-stimulating hormone (medullary 
thyroid carcinoma). Such peptides may be useful as diagnostic markers for tumorigenesis (Schwartz, 
M.Z. (1997) Semin. Pediatr. Surg. 3:141-146; and Said, S.I. and G.R. Faloona (1975) N. Engl. J. Med. 

20 293:155-160). 

Defective nuclear transport may play a role in cancer. The BRCA1 protein contains three 
potential NLSs which interact with importin alpha, and is transported into the nucleus by the 
importin/NPC pathway. In breast cancer cells the BRCA1 protein is aberrantly localized in the 
cytoplasm. The mislocation of the BRCA1 protean in breast cancer cells may be due to a defect in the 
25 NPC nuclear import pathway (Chen, C.F. et al. (1996) J. Biol. Chem. 271:32863-32868). 

It has been suggested that in some breast cancers, the tumor-suppressing activity of p53 is 
inactivated by the sequestration of the protein in the cytoplasm, away from its site of action in the cell 
nucleus. Cytoplasmic wild-type p53 was also found in human cervical carcinoma cell lines. (Moll, 
U.M. et al. (1992) Proc. Natl. Acad. Sci. USA 89:7262-7266; and Liang, X.H. et al. (1993) Oncogene 
30 8:2645-2652.) 

Environmental Responses 

Organisms respond to the environment by a number of pathways. Heat shock proteins, 
including hsp 70, hsp60, hsp90, and hsp 40, assist organisms in coping with heat damage to cellular 
proteins. 
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Aquaporins (AQP) are channels that transport water and, in some cases, nonionic small solutes 
such as urea and glycol. Water movement is important for a number of physiological processes 
including renal fluid filtration, aqueous humor generation in the eye, cerebrospinal fluid production in 
the brain, and appropriate hydration of the lung. Aquaporins are members of the major intrinsic protein 
5 (MIP) family of membrane transporters (King, L.S. and P. Agre (1996) Annu. Rev. Physiol. 58:619- 
648; Ishibashi, K. etal. (1997) J. Biol Cheia 272:20782-20786). The study of aquaporins may have 
relevance to understanding edema formation and fluid balance in both normal physiology and disease 
states (King, supra) . Mutations in AQP2 cause autosomal recessive nephrogenic diabetes insipidus 
(03VHM *107777 Aquaporin 2; AQP2). Reduced AQP4 expression in skeletal muscle may be 
10 associated with Duchenne muscular dystrophy (Frigeri, A. et al. (1998) J. Clin. Invest. 102:695-703). 
Mutations in AQP0 cause autosomal dominant cataracts in the mouse (OMIM *154050 Major Intrinsic 
Protein of Lens Fiber; MIP). 

The metallothioneins (MTs) are a group of small (61 amino acids), cysteine-rich proteins that 
bind heavy metals such as cadmium, zinc, mercury, lead, and copper and are thought to play a role in 
15 metal detoxification or the metabolism and homeostasis of metals. Arsenite-resistance proteins have 
been identified in hamsters that are resistant to toxic levels of arsenite (Rossman, T.G. et al. (1997) 
Mutat. Res. 386:307-314). 

Humans respond to light and odors by specific protein pathways. Proteins involved in light 
perception include rhodopsin, transducin, and cGMP phosphodiesterase. Proteins involved in odor 
2 o perception include multiple olfactory receptors. Other proteins are important inhuman Orcadian 
rhythms and responses to wounds. 
Immunity and Host Defense 

All vertebrates have developed sophisticated and complex immune systems that provide 
protection from viral, bacterial, fungal and parasitic infections. Included in these systems are the 

2 5 processes of humoral immunity, the complement cascade and the inflammatory response (Paul, W.E. 

(1993) Fundamental Immunology . Raven Press, Ltd., New York NY, pp.1-20). 

The cellular components of the humoral immune system include six different types of 
leukocytes: monocytes, lymphocytes, polymorphonuclear granulocytes (consisting of neutrophils, 
eosinophils, and basophils) and plasma cells. Additionally, fragments of megakaryocytes, a seventh 

3 o type of white blood cell in the bone marrow, occur in large numbers in the blood as platelets. 

Leukocytes are formed from two stem cell lineages in bone marrow. The myeloid stem cell 
line produces granulocytes and monocytes and, the lymphoid stem cell produces lymphocytes. 
Lymphoid cells travel to the thymus, spleen and lymph nodes, where they mature and differentiate 
into lymphocytes. Leukocytes are responsible for defending the body against invading pathogens. 
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Neutrophils and monocytes attack invading bacteria, viruses, and other pathogens and destroy them 
by phagocytosis. Monocytes enter tissues and differentiate into macrophages which are extremely 
phagocytic. Lymphocytes and plasma cells are a part of the immune system which recognizes 
specific foreign molecules and organisms and inactivates them, as well as signals other cells to attack 
5 the invaders. 

Granulocytes and monocytes are formed and stored in the bone marrow until needed. 
Megakaryocytes are produced in bone marrow, where they fragment into platelets and are released 
into the bloodstream. The main function of platelets is to activate the blood clotting mechanism. 
Lymphocytes and plasma cells are produced in various lymphogenous organs, including the lymph 
10 nodes, spleen, thymus, and tonsils. 

Both neutrophils and macrophages exhibit chemotaxis towards sites of inflammation. Tissue 
inflammation in response to pathogen invasion results in production of chemo-attractants for 
leukocytes, such as endotoxins or other bacterial products, prostaglandins, and products of leukocytes 
or platelets. 

15 Basophils participate in the release of the chemicals involved in the inflammatory process. 

The main function of basophils is secretion of these chemicals to such a degree that they have been 
referred to as "unicellular endocrine glands." A distinct aspect of basophilic secretion is that the 
contents of granules go directly into the extracellular environment, not into vacuoles as occurs with 
neutrophils, eosinophils and monocytes. Basophils have receptors for the Fc fragment of 

2 o immunoglobulin E (IgE) that are not present on other leukocytes. Crosslinking of membrane IgE 
with anti-IgE or other ligands triggers degranulation. 

Eosinophils are bi- or multi-nucleated white blood cells which contain eosinophilic granules. 
Their plasma membrane is characterized by Ig receptors, particularly IgG and IgE. Generally, 
eosinophils are stored in the bone marrow until recruited for use at a site of inflammation or invasion. 

2 5 They have specific functions in parasitic infections and allergic reactions, and are thought to detoxify 

some of the substances released by mast cells and basophils which cause inflammation. Additionally, 
they phagocytize antigen-antibody complexes and further help prevent spread of the inflammation. 

Macrophages are monocytes that have left the blood stream to settle in tissue. Once 
monocytes have migrated into tissues, they do not re-enter the bloodstream. The mononuclear 

3 o phagocyte system is comprised of precursor cells in the bone marrow, monocytes in circulation, and 

macrophages in tissues. The system is capable of very fast and extensive phagocytosis. A 
macrophage may phagocytize over 100 bacteria, digest them and extrude residues, and then survive 
for many more months. Macrophages are also capable of ingesting large particles, including red 
blood cells and malarial parasites. They increase several-fold in size and transform into macrophages 
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that are characteristic of the tissue they have entered, surviving in tissues for several months. 

Mononuclear phagocytes are essential in defending the body against invasion by foreign 
pathogens, particularly intracellular microorganisms such as M. tuberculosis , listeria, leishmania and 
toxoplasma. Macrophages can also control the growth of tumorous cells, via both phagocytosis and 
secretion of hydrolytic enzymes. Another important function of macrophages is that of processing 
antigen and presenting them in a biochemically modified form to lymphocytes. 

The immune system responds to invading microorganisms in two major ways: antibody 
production and cell mediated responses. Antibodies are immunoglobulin proteins produced by 
B-lymphocytes which bind to specific antigens and cause inactivation or promote destruction of the 
antigen by other cells. Cell-mediated immune responses involve T-lymphocytes (T cells) that react 
with foreign antigen on the surface of infected host cells. Depending on the type of T cell, the 
infected cell is either killed or signals are secreted which activate macrophages and other ceils to 
destroy the infected cell (Paul, supra) . 

T-lymphocytes originate in the bone marrow or liver in fetuses. Precursor cells migrate via 
the blood to the thymus, where they are processed to mature into T-lymphocytes. This processing is 
crucial because of positive and negative selection of T cells that will react with foreign antigen and 
not with self molecules. After processing, T cells continuously circulate in the blood and secondary 
lymphoid tissues, such as lymph nodes, spleen, certain epithelium-associated tissues in the 
gastrointestinal tract, respiratory tract and skin. When T-lymphocytes are presented with the 
complementary antigen, they are stimulated to proliferate and release large numbers of activated T 
cells into the lymph system and the blood system. These activated T cells can survive and circulate 
for several days. At the same time, T memory cells are created, which remain in the lymphoid tissue 
for months or years. Upon subsequent exposure to that specific antigen, these memory cells will 
respond more rapidly and with a stronger response than induced by the original antigen. This creates 
an "immunological memory" that can provide immunity for years. 

There are two major types of T cells: cytotoxic T cells destroy infected host cells, and helper 
T cells activate other white blood cells via chemical signals. One class of helper cell, T H 1, activates 
macrophages to destroy ingested microorganisms, while another, T H 2, stimulates the production of 
antibodies by B cells. 

Cytotoxic T cells directly attack the infected target cell. In virus-infected cells, peptides 
derived from viral proteins are generated by the proteasome. These peptides are transported into the 
ER by the transporter associated with antigen processing (TAP) (Pamer, E. and P. Cresswell (1998) 
Annu. Rev. Immunol. 16:323-358). Once inside the ER, the peptides bind MHC I chains, and the 
peptide/MHC I complex is transported to the cell surface. Receptors on the surface of T cells bind to 
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antigen presented on cell surface MHC molecules. Once activated by binding to antigen, T cells 
secrete y-wterf eron, a signal molecule that induces the expression of genes necessary for presenting 
viral (or other) antigens to cytotoxic T cells. Cytotoxic T cells kill the infected cell by stimulating 
programmed cell death. 

5 Helper T cells constitute up to 75 % of the total T cell population. They regulate the immune 

functions by producing a variety of lymphokines that act on other cells in the immune system and on 
bone marrow. Among these lymphokines are: interleukins-2,3,4,5,6; granulocyte-monocyte colony 
stimulating factor, and y-inteferon. 

Helper T cells are required for most B cells to respond to antigen. When an activated helper 

10 cell contacts a B cell, its centrosome and Golgi apparatus become oriented toward the B cell, aiding 
the directing of signal molecules, such as transmembrane-bound protein called CD40 ligand, onto the 
B cell surface to interact with the CD40 transmembrane protein. Secreted signals also help B cells to 
proliferate and mature and, in some cases, to switch the class of antibody being produced. 

B-lymphocytes (B cells) produce antibodies which react with specific antigenic proteins 

1 5 presented by pathogens. Once activated, B cells become filled with extensive rough endoplasmic 
reticulum and are known as plasma cells. As with T cells, interaction of B cells with antigen 
stimulates proliferation of only those B cells which produce . antibody specific to that antigen. There 
are five classes of antibodies, known as immunoglobulins, which together comprise about 20% of 
total plasma protein. Each class mediates a characteristic biological response after antigen binding. 

2 a Upon activation by specific antigen B cells switch from making membrane-bound antibody to 

secretion of that antibody. 

Antibodies, or immunoglobulins (Ig), are the founding members of the Ig superfamily and the 
central components of the humoral immune response. Antibodies are either expressed on the surface 
of B cells or secreted by B cells into the circulation. Antibodies bind and neutralize blood-borne 
25 foreign antigens. The prototypical antibody is a tetramer consisting of two identical heavy 

polypeptide chains (H-chains) and two identical light polypeptide chains (L-chains) interlinked by 
disulfide bonds. This arrangement confers the characteristic Y-shape to antibody molecules. 
Antibodies are classified based on their H-chain composition. The five antibody classes, IgA, IgD, 
IgE, IgG and IgM, are defined by the a, 6, e, y, and \i H-chain types. There are two types of L- 

3 o chains, k and A, either of which may associate as a pair with any H-chain pair. IgG, the most 

common class of antibody found in the circulation, is tetrameric, while the other classes of antibodies 
are generally variants or multimers of this basic structure. 

H-chains and L-chains each contain an N-terminal variable region and a C -terminal constant 
region. Both H-chains and L-chains contain repeated Ig domains. For example, a typical H-chain 
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contains four Ig domains, three of which occur within the constant region and one of which occurs 
within the variable region and contributes to the formation of the antigen recognition site. Likewise, 
a typical L-chain contains two Ig domains, one of which occurs within the constant region and one of 
which occurs within the variable region. In addition, H chains such as \i have been shown to 
5 associate with other polypeptides during differentiation of the B cell. 

Antibodies can be described in terms of their two main functional domains. Antigen 
recognition is mediated by the Fab (antigen binding fragment) region of the antibody, while effector 
functions are mediated by the Fc (crystallizable fragment) region. Binding of antibody to an antigen, 
such as a bacterium, triggers the destruction of the antigen by phagocytic white blood cells such as 

10 macrophages and neutrophils. These cells express surface receptors that specifically bind to the 
antibody Fc region and allow the phagocytic cells to engulf, ingest, and degrade the antibody-bound 
antigen. The Fc receptors expressed by phagocytic cells are single-pass transmembrane glycoproteins 
of about 300 to 400 amino acids (Sears, D.W. et al. (1990) J. Immunol. 144:371-378). The 
extracellular portion of the Fc receptor typically contains two or three Ig domains. 

1 5 Diseases which cause over- or under-abundance of any one type of leukocyte usually result in 

the entire immune defense system becoming involved. A well-known autoimmune disease is AIDS 
(Acquired Immunodeficiency Syndrome) where the number of helper T cells is depleted, leaving the 
patient susceptible to infection by microorganisms and parasites. Another widespread medical 
condition attributable to the immune system is that of allergic reactions to certain antigens. Allergic 

2 o reactions include: hay fever, asthma, anaphylaxis, and urticaria (hives). Leukemias are an excess 

production of white blood cells, to the point where a major portion of the body's metabolic resources 
are directed solely at proliferation of white blood cells, leaving other tissues to starve. Leukopenia or 
agranulocytosis occurs when the bone marrow stops producing white blood cells. This leaves the • 
body unprotected against foreign microorganisms, including those which normally inhabit skin, 
25 mucous membranes, and gastrointestinal tract. If all white blood cell production stops completely, 
infection will occur within two days and death may follow only 1 to 4 days latex. 

Impaired phagocytosis occurs in several diseases, including monocytic leukemia, systemic 
lupus, and granulomatous disease. In such a situation, macrophages can phagocytize normally, but 
the enveloped organism is not killed. A defect in the plasma membrane enzyme which converts 

3 o oxygen to lethally reactive forms results in abscess formation in liver, lungs, spleen, lymph nodes, 

and beneath the skin. Eosinophilia is an excess of eosinophils commonly observed in patients with 
allergies (hay fever, asthma), allergic reactions to drugs, rheumatoid arthritis, and cancers (Hodgkin's 
disease, lung, and liver cancer) (Isselbacher, KJ. et al. (1994) Harrison's Principles of Internal 
Medicine . McGraw-Hill, Inc., New York NY). 
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Host defense is further augmented by the complement system. The complement system 
serves as an effector system and is involved in infectious agent recognition. It can function as an 
independent immune network or in conjunction with other humoral immune responses. The 
complement system is comprised of numerous plasma and membrane proteins that act in a cascade of 
reaction sequences whereby one component activates the next. The result is a rapid and amplified 
response to infection through either an inflammatory response or increased phagocytosis. 

The complement system has more than 30 protein components which can be divided into 
functional groupings including modified serine proteases, membrane-binding proteins and regulators 
of complement activation. Activation occurs through two different pathways the classical and the 
alternative. Both pathways serve to destroy infectious agents through distinct triggering mechanisms 
that eventually merge with the involvement of the component C3. 

The classical pathway requires antibody binding to infectious agent antigens. The antibodies 
serve to define the target and initiate the complement system cascade, culminating in the destruction 
of the infectious agent In this pathway, since the antibody guides initiation of the process, the 
complement can be seen as an effector arm of the humoral immune system. 

The alternative pathway of the complement system does not require the presence of pre- 
existing antibodies for targeting infectious agent destruction. Rather, this pathway, through low 
levels of an activated component, remains constantly primed and provides surveillance in the non- 
immune host to enable targeting and destruction of infectious agents. In this case foreign material 
triggers the cascade, thereby facilitating phagocytosis or lysis (Paul, supra , pp.918-919). ' 

Another important component of host defense is the process of inflammation. Inflammatory 
responses are divided into four categories on the basis of pathology and include allergic 
inflammation, cytotoxic antibody mediated inflammation, immune complex mediated inflammation 
and monocyte mediated inflammation. Inflammation manifests as a combination of each of these 
forms with one predominating. 

Allergic acute inflammation is observed in individuals wherein specific antigens stimulate 
IgE antibody production. Mast cells and basophils are subsequently activated by the attachment of 
antigen-IgE complexes, resulting in the release of cytoplasmic granule contents such as histamine. 
The products of activated mast cells can increase vascular permeability and constrict the smooth 
muscle of breathing passages, resulting in anaphylaxis or asthma. Acute inflammation is also 
mediated by cytotoxic antibodies and can result in the destruction of tissue through the binding of 
complement-fixing antibodies to cells. The responsible antibodies are of the IgG or IgM types. 
Resultant clinical disorders include autoimmune hemolytic anemia and thrombocytopenia as 
associated with systemic lupus erythematosis. 
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Immune complex mediated acute inflammation involves the IgG or IgM antibody types 
which combine with antigen to activate the complement cascade. When such immune complexes 
bind to neutrophils and macrophages they activate the respiratory burst to form protein- and vessel- 
damaging agents such as hydrogen peroxide, hydroxyl radical, hypochlorous acid, and chloramines. 
Clinical manifestations include rheumatoid arthritis and systemic lupus erythematosus. 

In chronic inflammation or delayed-type hypersensitivity, macrophages are activated and 
process antigen for presentation to T cells that subsequently produce lymphokines and monokines. 
This type of inflammatory response is likely important for defense against intracellular parasites and 
certain viruses. Clinical associations include, granulomatous disease, tuberculosis, leprosy, and 
sarcoidosis (Paul, W.E., supra , pp.1017- 1018). 

Extracellular Information Transmission Molecules 

Intercellular communication is essential for the growth and survival of multicellular 
organisms, and in particular, for the function of the endocrine, nervous, and immune systems. In 
addition, intercellular communication is critical for developmental processes such as tissue 
construction and organogenesis, in which cell proliferation, cell differentiation, and morphogenesis 
must be spatially and temporally regulated in a precise and coordinated manner. Cells communicate 
with one another through the secretion and uptake of diverse types of signaling molecules such as 
hormones, growth factors, neuropeptides, and cytokines. 
Hormones 

Hormones are signaling molecules that coordinately regulate basic physiological processes 
from embryogenesis throughout adulthood. These processes include metabolism, respiration, 
reproduction, excretion, fetal tissue differentiation and organogenesis, growth and development, 
homeostasis, and the stress response. Hormonal secretions and the nervous system are tightly 
integrated and interdependent. Hormones are secreted by endocrine glands, primarily the hypothalamus 
and pituitary, the thyroid and parathyroid, the pancreas, the adrenal glands, and the ovaries and testes. 

The secretion of hormones into the circulation is tightly controlled Hormones are often 
secreted in diurnal, pulsatile, and cyclic patterns. Hormone secretion is regulated by perturbations in 
blood biochemistry, by other upstream-acting hormones, by neural impulses, and by negative feedback 
loops. Blood hormone concentrations are constantly monitored and adjusted to maintain optimal, 
steady-state levels. Once secreted, hormones act only on those target cells that express specific 
receptors. 

Most disorders of the endocrine system are caused by either hyposecretion or hypersecretion of 
hormones. Hyposecretion often occurs when a hormone's gland of origin is damaged or otherwise 
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impaired Hypersecretion often results from the proliferation of tumors derived from hormone-secreting 
cells. Inappropriate hormone levels may also be caused by defects in regulatory feedback loops or in 
the processing of hormone precursors. Endocrine malfunction may also occur when the target cell fails 
to respond to the hormone. 
5 Hormones can be classified biochemically as polypeptides, steroids, eicosanoids, or amines. 

Polypeptides, which include diverse hormones such as insulin and growth hormone, vary in size and 
function and are often synthesized as inactive precursors that are processed intr acellularly into mature, 
active forms. Amines, which include epinephrine and dopamine, are amino acid derivatives that 
function in neuroendocrine signaling. Steroids, which include the cholesterol-derived hormones 

1 o estrogen and testosterone, function in sexual development and reproduction. Eicosanoids, which 

include prostaglandins and prostacyclins, are fatty acid derivatives that function in a variety of 
processes. Most polypeptides and some amines are soluble in the circulation where they are highly 
susceptible to proteolytic degradation within seconds after their secretion. Steroids and lipids are 
insoluble and must be transported in the circulation by carrier proteins. The following discussion will 
15 focus primarily on polypeptide hormones. 

Hormones secreted by the hypothalamus and pituitary gland play a critical role in endocrine 
function by coordinated regulating hormonal secretions from other endocrine glands in response to 
neural signals. Hypothalamic hormones include thyrotropin-rdeasing hormone, gonadotropin-reieasing 
hormone, somatostatin, growth-hormone releasing factor, corticotropin-releasing hormone, substance P, 

2 o dopamine, and prolactin-releasing hormone. These hormones directly regulate the secretion of 

hormones from the anterior lobe of the pituitary. Hormones secreted by the anterior pituitary include 
adrenocorticotropic hormone (ACTH), mdanocyte-stimulating hormone, somatotropic hormones such 
as growth hormone and prolactin, glycoprotein hormones such as thyroid-stimulating hormone, 
luteinizing hormone (LH), and folUcle-stimulating hormone (FSH), P-lipotropin, and P -endorphins. 

2 5 These hormones regulate hormonal secretions from the thyroid, pancreas, and adrenal glands, and act 

directly on the reproductive organs to stimulate ovulation and spermatogenesis. The posterior pituitary 
synthesizes and secretes antidiuretic hormone (ADH, vasopressin) and oxytocin. 

Disorders of the hypothalamus and pituitary often result from lesions such as primary brain 
tumors, adenomas, infarction associated with pregnancy, hypophysectomy, aneurysms, vascular 

3 o malformations, thrombosis, infections, immunological disorders, and complications due to head trauma. 

Such disorders have profound effects on the function of other endocrine glands. Disorders associated 
with hypopituitarism include hypogonadism, Sheehan syndrome, diabetes insipidus, Kallman's disease, 
Hand-Schufler-Christian disease, Letterer-Siwe disease, sarcoidosis, empty sella syndrome, and 
dwarfism. Disorders associated with hyperpituitarism include acromegaly, giantism, and syndrome of 
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inappropriate ADH secretion (SIADH), often caused by benign adenomas. 

Hormones secreted by the thyroid and parathyroid primarily control metabolic rates and the 
regulation of serum calcium levels, respectively. Thyroid hormones include calcitonin, somatostatin, 
and thyroid hormone. The parathyroid secretes parathyroid hormone. Disorders associated with 
hypothyroidism include goiter, myxedema, acute thyroiditis associated with bacterial infection, 
subacute thyroiditis associated with viral infection, autoimmune thyroiditis (Hashimoto's disease), and 
actinism. Disorders associated with hyperthyroidism include thyrotoxicosis and its various forms, 
Grave's disease, pretibial myxedema, toxic multinodular goiter, thyroid carcinoma, and Plummer's 
disease Disorders associated with hyperparathyroidism include Conn disease (chronic hypercalemia) 
leading to bone resorption and parathyroid hyperplasia. 

Hormones secreted by the pancreas regulate blood glucose levels by modulating the rates of 
carbohydrate, fat, and protein metabolism. Pancreatic iioriiioiies include insulin, glucagon, amyiin, y- 
aminobutyric add, , gastrin, somatostatin, and pancreatic polypeptide. The principal disorder associated 
with pancreatic dysfunction is diabetes mellitus caused by insufficient insulin activity. Diabetes 
mellitus is generally classified as either Type I (insulin-dependent, juvenile diabetes) or Type n (non- 
insulin-dependent, adult diabetes). The treatment of both forms by insulin replacement therapy is well 
known. Diabetes mellitus often leads to acute complications such as hypoglycemia (insulin shock), 
coma, diabetic ketoacidosis, lactic acidosis, and chronic complications leading to disorders of the eye, 
kidney, skin, bone, joint, cardiovascular system, nervous system, and to decreased resistance to 
infection. 

The anatomy, physiology, and diseases related to hormonal function are reviewed in McCance, 
KL. and S.E. Huether (1994) Pathophysiology: The Biological Basis for Disease in Adults and 
Children, Mosby-Year Book, Inc., St. Louis MO; Greenspan, RS. and J.D. Baxter (1994) Basic and 
Clinical Endocrinology . Appleton and Lange, East Norwalk CT. 
Growth Factors 

Growth factors are secreted proteins that mediate intercellular communication. Unlike 
hormones, which travel great distances via the circulatory system, most growth factors are primarily 
local mediators that act on neighboring cells. Most growth factors contain a hydrophobic N-terminal 
signal peptide sequence which directs the growth factor into the secretory pathway. Most growth 
factors also undergo post-translational modifications within the secretory pathway. These 
modifications can include proteolysis, glycosylation, phosphorylation, and intramolecular disulfide bond 
formation. Once secreted, growth factors bind to specific receptors on the surfaces of neighboring 
target cells, and the bound receptors trigger intracellular signal transduction pathways. These signal 
transduction pathways elicit specific cellular responses in the target cells. These responses can include 
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the modulation of gene expression and the stimulation or inhibition of cell division, ceil differentiation, 
and cell motility. 

Growth factors fall into at least two broad and overlapping classes. The broadest class 
includes the large polypeptide growth factors, which are wide-ranging in their effects. These factors 
include epidermal growth factor (EGF), fibroblast growth factor (FGF), transforming growth factor-P 
(TGF-(J), insulin-like growth factor (IGF), nerve growth factor (NGF), and platelet-derived growth 
factor (PDGF), each defining a family of numerous related factors. The large polypeptide growth 
factors, with the exception of NGF, act as mitogens on diverse cell types to stimulate wound healing, 
bone synthesis and remodeling, extracellular matrix synthesis, and proliferation of epithelial, 
epidermal, and connective tissues. Members of the TGF-(J, EGF, and FGF families also function as 
inductive signals in the differentiation of embryonic tissue. NGF functions specifically as a 
neurotrophic factor, promoting neuronal growth and differentiation. 

Another class of growth factors includes the hematopoietic growth factors, which are narrow in 
their target specificity. These factors stimulate the proliferation and differentiation of blood cells such 
as B-lymphocytes, T-lymphocytes, erythrocytes, platelets, eosinophils, basophils, neutrophils, 
macrophages, and their stem cell precursors. These factors include the colony-stimulating factors (G- 
CSF, M-CSF, GM-CSF, and CSF1-3), erythropoietin, and the cytokines. The cytokines are specialized 
hematopoietic factors secreted by cells of the immune system and are discussed in detail below. 

Growth factors play critical roles in neoplastic transformation of cells in vitro and in tumor 
progression in vivo . Overexpression of the large polypeptide growth factors promotes the 
proliferation and transformation of cells in culture. Inappropriate expression of these growth factors 
by tumor cells in vivo may contribute to tumor vascularization and metastasis. Inappropriate activity 
of hematopoietic growth factors can result in anemias, leukemias, and lymphomas. Moreover, growth 
factors are both structurally and functionally related to oncoproteins, the potentially cancer-causing 
products of proto-oncogenes. Certain FGF and PDGF family members are themselves homologous to 
oncoproteins, whereas receptors for some members of the EGF, NGF, and FGF families are encoded 
by proto-oncogenes. Growth factors also affect the transcriptional regulation of both proto-oncogenes 
and oncosuppressor genes (Pimentel, E. (1994) Handbook of Growth Factors . CRC Press, Ann Arbor 
MI; McKay, I. and I. Leigh, eds. (1993) Growth Factors: A Practical Approach , Oxford University 
Press, New York NY; Habenicht, A., ed. (1990) Growth Factors. Differentiation Factors, and 
Cytokines . Springer-Verlag, New York NY). 

In addition, some of the large polypeptide growth factors play crucial roles in the induction of 
the primordial germ layers in the developing embryo. This induction ultimately results in the formation 
of the embryonic mesoderm, ectoderm, and endoderm which in turn provide the framework for the 
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entire adult body plan. Disruption of this inductive process would be catastrophic to embryonic 
development. 

Small Peptide Factors - Neuropeptides and Vasomediators 

Neuropeptides and vasomediators (NP/VM) comprise a family of small peptide factors, 
typically of 20 amino acids or less. These factors generally function in neuronal excitation and 
inhibition of vasoconstriction/vasodilation, muscle contraction, and hormonal secretions from the 
brain and other endocrine tissues. Included in this family are neuropeptides and neuropeptide 
hormones such as bombesin, neuropeptide Y, neurotensin, neuromedin N, melanocortins, opioids, 
galanin, somatostatin, tachykinins, urotensin n and related peptides involved in smooth muscle 
stimulation, vasopressin, vasoactive intestinal peptide, and circulatory system-borne signaling 
molecules such as angiotensin, complement, calcitonin, endothelins, formyl-metbionyl peptides, 
glucagon, choiecystokinin, gastrin, and many of the peptide hormones discussed above. NP/VMs can 
transduce signals directly, modulate the activity or release of other neurotransmitters and hormones, and 
act as catalytic enzymes in signaling cascades. The effects of NP/VMs range from extremely brief to 
long-lasting. (Reviewed in Martin, C.R. et al. (1985) Endocrine Physiology, Oxford University Press, 
New York NY, pp. 57-62.) 
Cytokines 

Cytokines comprise a family of signaling molecules that modulate the immune system and the - 
inflammatory response. Cytokines are usually secreted by leukocytes, or white blood cells, in response 
to injury or infection. Cytokines function as growth and differentiation factors that act primarily on 
cells of the immune system such as B- and T-lymphocytes, monocytes, macrophages, and granulocytes. 
Like other signaling molecules, cytokines bind to specific plasma membrane receptors and trigger 
intracellular signal transduction pathways which alto gene expression patterns. There is considerable 
potential for the use of cytokines in the treatment of inflammation and immune system disorders. 

Cytokine structure and function have been extensively characterized in vitro . Most cytokines 
are small polypeptides of about 30 kilodaltons or less. Ova: 50 cytokines have been identified from 
human and rodent sources. Examples of cytokine subfamilies include the interferons (IFN-a, -P, and - 
Y), the interleukins (IL1-IL13), the tumor necrosis factors (TNF-a and -P), and the chemokines. Many 
cytokines have been produced using recombinant DNA techniques, and the activities of individual 
cytokines have been determined in vitro . These activities include regulation of leukocyte proliferation, 
differentiation, and motility. 

The activity of an individual cytokine in vitro may not reflect the full scope of that cytokine's 
activity in vivo . Cytokines are not expressed individually in vivo but are instead expressed in 
combination with a multitude of other cytokines when the organism is challenged with a stimulus. 
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Together, these cytokines collectively modulate the immune response in a manna: appropriate for that 
particular stimulus. Therefore, the physiological activity of a cytokine is determined by the stimulus 
itself and by complex interactive networks among co-expressed cytokines which may demonstrate both 
synergistic and antagonistic relationships. 
5 Chemokines comprise a cytokine subfamily with over 30 members. (Reviewed in Wells, T. 

N.C. and M.C. Peitsch (1997) J. Leukoc. Biol. 61:545-550.) Chemokines were initially identified as 
chemotactic proteins that recruit monocytes and macrophages to sites of inflammation. Recent evidence 
indicates that chemokines may also play key roles in hematppoiesis and HIV-1 infectioa Chemokines 
are small proteins which range from about 6-15 kflodaltons in molecular weight. Chemokines are 

10 further classified as C, CC, CXC, or CX 3 C based on the number and position of critical cysteine 
residues. The CC chemokines, for example, each contain a conserved motif consisting of two 
consecutive cysteines followed by two additional cysteines which occur downstream at 24- and 16- 
residue intervals, respectively (ExPASy PROSITE database, documents PS00472 and PDOC00434). 
The presence and spacing of these four cysteine residues are highly conserved, whereas the intervening 

15 residues diverge significantly. However, a conserved tyrosine located about 15 residues downstream of 
the cysteine doublet seems to be important for chemotactic activity. Most of the human genes encoding 
CC chemokines are clustered on chromosome 17, although tee are a few examples of CC chemokine 
genes that map elsewhere. Other chemokines include lymphotactin (C chemokine); macrophage 
chemotactic and activating factor (MCAF/MCP-1 ; CC chemokine); platelet factor 4 and EL-8 (CXC 

20 chemokines); and fi;actalkine and neurotractin (CX3C chemokines). (Reviewed in Luster, A.D. (1998) 
N. Engl. J. Med. 338:436-445.) 

Receptor Molecules 

The term receptor describes proteins that specifically recognize other molecules. The category 
25 is broad and includes proteins with a variety of functions. The bulk of receptors are cell surface 
proteins which bind extracellular ligands and produce cellular responses in the areas of growth, 
differentiation, endocytosis, and immune response Other receptors facilitate the selective transport of 
proteins out of the endoplasmic reticulum and localize enzymes to particular locations in the cell. The 
term may also be applied to proteins which act as receptors for ligands with known or unknown 
3 o chemical composition and which interact with other cellular components. For example, the steroid 
hormone receptors bind to and regulate transcription of DNA. 

Regulation of ceil proliferation, differentiation, and migration is important for the formation 
and junction of tissues. Regulatory proteins such as growth factors coordinate^ control these cellular 
processes and act as mediators in cell-cell signaling pathways. Growth factors are secreted proteins 
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that bind to specific cefl-surface receptors on target cells. The bound receptors trigger intracellular 
signal transduction pathways which activate various downstream effectors that regulate gene 
expression, cell division, cell differentiation, cell motility, and other cellular processes. 

Cell surface receptors are typically integral plasma membrane proteins. These receptors 
recognize hormones such as catecholamines; peptide hormones; growth and differentiation factors; 
small peptide factors such as thyrotropin-releasing hormone; galanin, somatostatin, and tachykinins; 
and circulatory system-borne signaling molecules. Cell surface receptors on immune system cells 
recognize antigens, antibodies, and major histocompatibility complex (MHC)-bound peptides. Other 
cell surface receptors bind ligands to be internalized by the cell. This receptor-mediated endocytosis 
functions in the uptake of low density lipoproteins (LDL), transferrin, glucose- or mannose-terminal 
glycoproteins, galactose-terminal glycoproteins, immnnoglobnlins, phosphoviteHogenins, fibrin, 
proteinase-ipJnbitor complexes, plasminogen activators, and throiiujospunuin (Lodish, K. et ai. (1595) 
Molecular Cell Biology . Scientific American Books, New York NY, p. 723; Mikhailenko, I. et al. 
(1997) J. Biol. Chem. 272:6784-6791). 
Receptor Protein Kinases 

Many growth factor receptors, including receptors for epidermal growth factor, 
platelet-derived growth factor, fibroblast growth factor, as well as the growth modulator a-thrombin, 
contain intrinsic protean kinase activities. When growth factor binds to the receptor, it triggers the 
autophosphorylation of a serine, threonine, or tyrosine residue on the receptor. These phospharylated 
sites are recognition sites for the binding of other cytoplasmic signaling proteins. These proteins 
participate in signaling pathways that eventually link the initial receptor activation at the cell surface to 
the activation of a specific intracellular target molecule. In the case of tyrosine residue 
autophosphorylation, these signaling proteins contain a common domain referred to as a Src homology 
(SH) domain. SH2 domains and SH3 domains are found in phospholipase O7, PI-3-K p85 regulatory 
subunit, Ras-GTPase activating protein, and ppeo 0 ** (Lowenstein, E.J. et al. (1992) Cell 70:431-442). 
The cytokine family of receptors share a different common binding domain and include transmembrane 
receptors for growth hormone (GH), interleukins, erythropoietin, and prolactin. 

Other receptors and second messenger-binding proteins have intrinsic serine/threonine protein 
kinase activity. These include activin/TGF-p/BMP-supafamily receptors, calcium- and diacylglycerol- 
activated/phospholipid-d^endant protein kinase (PK-C), and RNA-dependant protein kinase (PK-R). 
In addition, other serine/threonine protein kinases, including nematode Twitchin, have fibronectin-like, 
immunoglobulin C2-like domains. 
G-Protein Coupled Receptors 

G-protein coupled receptors (GPCRs) are integral membrane proteins characterized by the 
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presence of seven hydrophobic transmembrane domains which span the plasma membrane and form a 
bundle of antiparaM alpha (a) helices. These proteins range in size from under 400 to over 1000 
amino acids (Strosberg, AD. (1991) Eur. J. Biochem. 196:1-10; Cougfflin, S.R. (1994) Curr. Opia 
Cell Biol. 6:191-197). The amino-terminus of the GPCR is extracellular, of variable length and often 
5 glycosylated; the carboxy-terminus is cytoplasmic and generally phosphorylated Extracellular loops of 
the GPCR alternate with intracellular loops and link the transmembrane domains. The most conserved 
domains of GPCRs are the transmembrane domains and the first two cytoplasmic loops. The 
transmembrane domains account for structural and functional features of the receptor. In most cases, 
the bundle of a helices forms a binding pocket. In addition, the extracellular N-terminal segment or one 

10 or more of the three extracellular loops may also participate in ligand binding. Ligand binding activates 
the receptor by inducing a conformational change in intracellular portions of the receptor. The 
activated receptor, in turn, interacts with an intracellular heterotrimeric guanine nucleotide binding (G) 
protean complex which mediates further intracellular signaling activities, generally the production of 
second messengers such as cyclic AMP (cAMP), phospholipase C, inositol triphosphate, or interactions 

15 with ion channel proteins (Baldwin, J.M. (1994) Curr. Opia Cell Biol. 6:180-190). 

GPCRs include those for acetylcholine, adenosine, epinephrine and norepinephrine, bombesin,' 
bradykinin, chemokines, dopamine, endothelin, y-aminobutyric acid (GAB A), follicle-stimulating 
hormone (FSH), glutamate, gonadotropin-releasing hormone (GnRH), hepatocyte growth factor, 
histamine, leukotrienes, melanocortins, neuropeptide Y, opioid peptides, opsins, prostanoids, serotonin, 

2 o somatostatin, tachykinins, thrombin, thyrotropin-releasing hormone (TRH), vasoactive intestinal 
polypeptide family, vasopressin and oxytocin, and orphan receptors. 

GPCR mutations, which may cause loss of function or constitutive activation, have been 
associated with numerous human diseases (Cougblin, supra) . For instance, retinitis pigmentosa may 
arise from mutations in the rhodopsin gene. Rhodopsin is the retinal photoreceptor which is located 

2 5 within the discs of the eye rod cell. Parma, J. et al. (1993, Nature 365 :649-65 1) report that somatic 

activating mutations in the thyrotropin receptor cause hyperfunctioning thyroid adenomas and suggest 
that certain GPCRs susceptible to constitutive activation may behave as protooncogenes. 
Nuclear Receptors 

Nuclear receptors bind small molecules such as hormones or second messengers, leading to 

3 o increased receptor-binding affinity to specific chromosomal DNA elements. In addition the affinity for 

other nuclear proteins may also be altered. Such binding and protein-protein interactions may regulate 
and modulate gene expression. Examples of such receptors include the steroid hormone receptors 
family, the retinoic acid receptors family, and the thyroid hormone receptors family. 
Ligand-Gated Rec eptor Inn Channels 
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Ligand-gated receptor ion channels fall into two categories. The first category, extracellular 
ligand-gated receptor ion channels (ELGs), rapidly transduce neurotransmitter-binding events into 
electrical signals, such as fast synaptic neurotransmission. ELG function is regulated by post- 
radiational modification. The second category, intracellular ligand-gated receptor ion channels 
5 (ILGs), are activated by many intracellular second messengers and do not require post-translational 
modification(s) to effect a channel-opening response. 

ELGs depolarize excitable cells to the threshold of action potential generation. In non-excitable 
cells, ELGs permit a limited calcium ion-influx during the presence of agonist. ELGs include channels 
directly gated by neurotransmitters such as acetylcholine, L-glutamate, glycine, ATP, serotonin, 

10 GAB A, and histamine. ELG genes encode proteins having strong structural and functional similarities. 
ILGs are encoded by distinct and unrelated gene families and include receptors for cAMP, cGMP, 
calcium ions, ATP, and metabolites of arachidoaic acid. 
Macrophage Scavenger Receptors 

Macrophage scavenger receptors with broad ligand specificity may participate in the binding of 

is low density lipoproteins (LDL) and foreign antigens. Scavenger receptors types I and II are trimeric 
membrane proteins with each subunit containing a small N-terminal intracellular domain, a 
transmembrane domain, a large extracellular domain, and a C-terminal cystdne-rich domain: The 
extracellular domain contains a short spacer domain, an a-helical coiled-coil domain, and a triple helical 
collagenous domain These receptors have been shown to bind a spectrum of ligands, including 

2 o chemically modified lipoproteins and albumin, polyribonucleotides, polysaccharides, phospholipids, and 
asbestos (Matsumoto, A. et al. (1990) Proc. Natl. Acad Sci. USA 87:9133-9137; Elomaa, 0. et al. 
(1995) Cell 80:603-609). The scavenger receptors are thought to play a key role in atherogenesis by 
mediating uptake of modified LDL in arterial walls, and in host defense by binding bacterial 
endotoxins, bacteria, and protozoa. 

25 T-Cell Receptors 

T cells play a dual role in the immune system as effectors and regulators, coupling antigen 
recognition with the transmission of signals that induce cell death in infected cells and stimulate 
proliferation of other immune ceils. Although a population of T cells can recognize a wide range of 
different antigens, an individual T cell can only recognize a singje antigen and only when it is presented 

30 to the T cell receptor (TCR) as a peptide complexed with a major histocompatibility molecule (MHC) 
on the surface of an antigen presenting cell. The TCR on most T cells consists of immunoglobulin-like 
integral membrane glycoproteins containing two polypeptide subunits, a and p, of similar molecular 
weight Both TCR subunits have an extracellular domain containing both variable and constant 
regions, a transmembrane domain that traverses the membrane once, and a short intracellular domain 
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(Saito, H. et al. (1984) Nature 309:757-762). The genes for the TCR subunits are constructed through 
somatic rearrangement of different gene segments. Interaction of antigen in the proper MHC context 
with the TCR initiates signaling cascades that induce the proliferation, maturation, and function of 
cellular components of the immune system (Weiss, A (1991) Annu. Rev. Genet 25:487-510). 
5 Rearrangements in TCR genes and alterations in TCR expression have been noted in lymphomas, 
leukemias, autoimmune disorders, and immunodeficiency disorders (Aisenberg, AC. et al. (1985) N. 
Engl. J. Med 313:529-533; Weiss, supra) . 

Intracellular Signaling Molecules 

10 Intracellular signaling is the general process by which cells respond to extracellular signals 

(hormones, neurotransmitters, growth and differentiation factors, etc.) through a cascade of 
biochemical reactions that begins with the binding of a signaling molecule to a cell membrane 
receptor and ends with the activation of an intracellular target molecule. Intermediate steps in the 
process involve the activation of various cytoplasmic proteins by phosphorylation via protein kinases, 

15 and their deactivation by protein phosphatases, and the eventual translocation of some of these 
activated proteins to the cell nucleus where the transcription of specific genes is triggered. The 
intracellular signaling process regulates all types of cell functions including cell proliferation, cell 
differentiation, and gene transcription, and involves a diversity of molecules including protein kinases 
and phosphatases, and second messenger molecules, such as cyclic nucleotides, calcium-calmodulin, 

2 o inositol, and various mitogens, that regulate protein phosphorylation. 

Protein Phosphorylation 

Protein kinases and phosphatases play a key role in the intracellular signaling process by 
controlling the phosphorylation and activation of various signaling proteins. The high energy 
phosphate for this reaction is generally transferred from the adenosine triphosphate molecule (ATP) to 
25 a particular protein by a protein kinase and removed from that protein by a protein phosphatase. 
Protein kinases are roughly divided into two groups: those that phosphorylate tyrosine residues 
(protein tyrosine kinases, PTK) and those that phosphorylate serine or threonine residues 
(serine/threonine kinases, STK). A few protein kinases have dual specificity for serine/threonine and 
tyrosine residues. Almost all kinases contain a conserved 250-300 amino acid catalytic domain 

3 o containing specific residues and sequence motifs characteristic of the kinase family (Hardie, G. and S. 

Hanks (1995) The Protein Kinase Facts Books , Vol 1:7-20, Academic Press, San Diego CA). 

STKs include the second messenger dependent protein kinases such as the cyclic- AMP 
dependent protein kinases (PKA), involved in mediating hormone-induced cellular responses; 
calcium-calmodulin (CaM) dependent protein kinases, involved in regulation of smooth muscle 
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contraction, glycogen breakdown, and neurotransmission; and the mitogen-activated protein kinases 
(MAP) which mediate signal transduction from the cell surface to the nucleus via phosphorylation 
cascades. Altered PKA expression is implicated in a variety of disorders and diseases including 
cancer, thyroid disorders, diabetes, atherosclerosis, and cardiovascular disease (Isselbacher, K. J. et al. 
5 (1994) Harrison's Principles of Internal Medicine McGraw-Hill, New York NY, pp. 416-431, 1887). 

PTKs are divided into transmembrane, receptor PTKs and nontransmembrane, non-receptor 
PTKs. Transmembrane PTKs are receptors for most growth factors. Non-receptor PTKs lack 
transmembrane regions and, instead, form complexes with the intracellular regions of cell surface 
receptors. Receptors that function through non-receptor PTKs include those for cytokines and 

10 hormones (growth hormone and prolactin) and antigen-specific receptors on T and B lymphocytes. 
Many of these PTKs ware first identified as the products of mutant oncogenes in cancer cells in which 
their activation was no longer subject to normal cellular controls. In fact, about one third of the 
known oncogenes encode PTKs, and it is well known that cellular transformation (oncogenesis) is 
often accompanied by increased tyrosine phosphorylation activity (Charbonneau, H. and N.K. Tonks 

15 (1992) Annu. Rev. CeU Biol. 8:463-493). 

An additional family of protein kinases previously thought to exist only in procaryotes is the • 
histidine protein kinase family (HPK). HPKs bear little homology with mammalian STKs or PTKs 
but have distinctive sequence motifs of their own (Davie, J.R. et al. (1995) J. Biol. Chem. 
270: 19861-19867). A histidine residue in the N-terminal half of the molecule (region I) is an 

2 o autophosphorylation site. Three additional motifs located in the C-terminal half of the molecule 
include an invariant asparagine residue in region n and two glycine-rich loops characteristic of 
nucleotide binding domains in regions m and IV. Recently a branched chain alpha-ketoacid 
dehydrogenase kinase has been found with characteristics of HPK in rat (Davie, supra) . 

Protein phosphatases regulate the effects of protein kinases by removing phosphate groups 

2 5 from molecules previously activated by kinases. The two principal categories of protein phosphatases 
are the protein (serine/threonine) phosphatases (PPs) and the protein tyrosine phosphatases (PTPs). 
PPs dephosphorylate phosphoserine/threonine residues and are important regulators of many 
cAMP-mediated hormone responses (Cohen, P. (1989) Annu. Rev. Biochem. 58:453-508). PTPs 
reverse the effects of protein tyrosine kinases and play a significant role in cell cycle and cell 

30 signaling processes (Charbonneau. supra) . As previously noted, many PTKs are encoded by 

oncogenes, and oncogenesis is often accompanied by increased tyrosine phosphorylation activity. It 
is therefore possible that PTPs may prevent or reverse cell transformation and the growth of various 
cancers by controlling the levels of tyrosine phosphorylation in cells. This hypothesis is supported by 
studies showing that overexpression of PTPs can suppress transformation in cells, and that specific 
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inhibition of PTPs can enhance cell transformation (Charbonneau, supra) . 
Phospholipid and Inositol-Phosphate Signaling 

Inositol phospholipids (phosphoinositides) are involved in an intracellular signaling pathway 
that begins with binding of a signaling molecule to a G-protein linked receptor in the plasma 
membrane. This leads to the phosphorylation of phosphatidylinositol (PI) residues on the inner side 
of the plasma membrane to the biphosphate state (PIPj) by inositol kinases. Simultaneously, the G- 
protein linked receptor binding stimulates a trimeric G-protein which in turn activates a 
phosphoinositide-specific phospholipase C-p. Phospholipase C-P then cleaves PIP 2 into two 
products, inositol triphosphate (IP 3 ) and diacylglycerol. These two products act as mediators for 
separate signaling events. IP 3 diffuses through the plasma membrane to induce calcium release from 
the endoplasmic reticulum (ER), while diacylglycerol remains in the membrane and helps activate 
protein kinase C. an STK that phosphorylates selected proteins in the target cell. The calcium 
response initiated by IP 3 is terminated by the dephosphorylation of IP3 by specific inositol 
phosphatases. Cellular responses that are mediated by this pathway are glycogen breakdown in the 
liver in response to vasopressin, smooth muscle contraction in response to acetylcholine, and 
thrombin-induced platelet aggregation. 
Cyclic Nucleotide Signaling 

Cyclic nucleotides (cAMP and cGMP) function as intracellular second messengers to • 
transduce a variety of extracellular signals including hormones, light, and neurotransmitters. In 
particular, cyclic-AMP dependent protein kinases (PKA) are thought to account for all of the effects 
of cAMP in most mammalian cells, including various hormone-induced cellular responses. Visual 
excitation and the phototransmission of light signals in the eye is controlled by cyclic-GMP regulated, 
Ca^-specific channels. Because of the importance of cellular levels of cyclic nucleotides in 
mediating these various responses, regulating the synthesis and breakdown of cyclic nucleotides is an 
important matter. Thus adenylyl cyclase, which synthesizes cAMP from AMP, is activated to 
increase cAMP levels in muscle by binding of adrenaline to P-andrenergic receptors, while activation 
of guanylate cyclase and increased cGMP levels in photoreceptors leads to reopening of the 
Ca 2+ -specific channels and recovery of the dark state in the eye. In contrast, hydrolysis of cyclic 
nucleotides by cAMP and cGMP-specific phosphodiesterases (PDEs) produces the opposite of these 
and other effects mediated by increased cyclic nucleotide levels. PDEs appear to be particularly 
important in the regulation of cyclic nucleotides, considering the diversity found in this family of 
proteins. At least seven families of mammalian PDEs (PDE1-7) have been identified based on 
substrate specificity and affinity, sensitivity to cofactors, and sensitivity to inhibitory drugs (Beavo, 
J.A. (1995) Physiological Reviews 75:725-748). PDE inhibitors have been found to be particularly 
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useful in treating various clinical disorders. Rolipram, a specific inhibitor of PDE4, has been used in 
the treatment of depression, and similar inhibitors are undergoing evaluation as anti-inflammatory 
agents. Theophylline is a nonspecific PDE inhibitor used in the treatment of bronchial asthma and 
other respiratory diseases (Banner, K.H. and CP. Page (1995) Eur. Respir. J. 8:996-1000). 
5 G-Protein Signaling 

Guanine nucleotide binding proteins (G-proteins) are critical mediators of signal transduction 
between a particular class of extracellular receptors, the G-protein coupled receptors (GPCR), and 
intracellular second messengers such as cAMP and Ca 2+ . G-proteins are linked to the cytosolic side 
of a GPCR such that activation of the GPCR by ligand binding stimulates binding of the G-protein to 

1 o GTP, inducing an " active" state in the G-protein. In the active state, the G-protein acts as a signal to 

trigger other events in the cell such as the increase of cAMP levels or the release of Ca 2+ into the 
cytosol from the ER, which, in turn, regulate phosphorylation and activation of other intracellular 
proteins. Recycling of the G-protein to the inactive state involves hydrolysis of the bound GTP to 
GDP by a GTPase activity in the G-protein. (See Alberts, B. et aL (1994) Molecular Biology of the 
15 Cell, Garland Publishing, Inc., New York NY, pp.734-759.) Two structurally distinct classes of G- 
proteins are recognized: heterotrimeric G-proteins, consisting of three different subunits, and 
monomeric, low molecular weight (LMW), G-proteins consisting of a single polypeptide chain. 

The three polypeptide subunits of heterotrimeric G-proteins are the a, J3, and y subunits. The 
a subunit binds and hydrolyzes GTP. The P and y subunits form a tight complex that anchors the 

2 o protein to the inner side of the plasma membrane. The P subunits, also known as G- P proteins or P 

transducins, contain seven tandem repeats of the WD-repeat sequence motif, a motif found in many 
proteins with regulatory functions. Mutations and variant expression of p transdudn proteins are 
linked with various disorders (Neer, E.J. et al. (1994) Nature 371:297-300; Margottin, F. et al. (1998) 
Mol. Cell 1:565-574). 

2 5 LMW GTP-proteins are GTPases which regulate cell growth, cell cycle control, protein 

secretion, and intracellular vesicle interaction. They consist of single polypeptides which, like the a 
subunit of the heterotrimeric G-proteins, are able to bind and hydrolyze GTP, thus cycling between an 
inactive and an active state. At least sixty members of the LMW G-protein superf amily have been 
identified and are currently grouped into the six subfamilies of ras, rho, arf , sari , ran, and rab. 

3 o Activated ras genes were initially found in human cancers, and subsequent studies confirmed that ras 

function is critical in determining whether cells continue to grow or become differentiated. Other 
members of the LMW G-protein superf amily have roles in signal transduction that vary with the 
function of the activated genes and the locations of the G-proteins. 

Guanine nucleotide exchange factors regulate the activities of LMW G-proteins by 
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determining whether GTP or GDP is bound. GTPase-activating protein (GAP) binds to GTP-ras and 
induces it to hydrolyze GTP to GDP. In contrast, guanine nucleotide releasing protein (GNRP) binds 
to GDP-ras and induces the release of GDP and the binding of GTP. 

Other regulators of G-protein signaling (RGS) also exist that act primarily by negatively 
5 regulating the G-protein pathway by an unknown mechanism (Druey, K.M et al. (1996) Nature 
379:742-746). Some 15 members of the RGS family have been identified. RGS family members are 
related structurally through similarities in an approximately 120 amino add region termed the RGS 
domain and functionally by their ability to inhibit the interleukin (cytokine) induction of MAP kinase 
in cultured mammalian 293T cells (Druey, supra) . 

10 Calcium Signaling Molecules 

Ca +2 is another second messenger molecule that is even more widely used as an intracellular 
mediator than cAMP. Two pathways exist by which Ca +2 can enter the cytosol in response to 
extracellular signals: One pathway acts primarily in nerve signal transduction where Ca* 1 enters a 
nerve terminal through a voltage-gated Ca +2 channel. The second is a more ubiquitous pathway in 

15 which Ca +2 is released from the ER into the cytosol in response to binding of an extracellular 

signaling molecule to a receptor. Ca 2 * directly activates regulatory enzymes, such as protein kinase C, 
which trigger signal transduction pathways. Ca 2+ also binds to specific Ca^-binding proteins (CBPs) 
such as calmodulin (CaM) which then activate multiple target proteins in the cell including enzymes, 
membrane transport pumps, and ion channels. CaM interactions are involved in a multitude of cellular 

20 processes including, but not limited to, gene regulation, DNA synthesis, cell cycle progression, 
mitosis, cytokinesis, cytoskeletal organization, muscle contraction, signal transduction, ion 
homeostasis, exocytosis, and metabolic regulation (Celio, M.R. et al. (1996) Guidebook to 
Calcium-binding Proteins . Oxford University Press, Oxford, UK, pp. 15-20). Some CBPs can serve 
as a storage depot for Ca 2 * in an inactive state. Calsequestrin is one such CBP that is expressed in 

2 5 isof orms specific to cardiac muscle and skeletal muscle. It is suggested that calsequestrin binds Ca 2+ 

in a rapidly exchangeable state that is released during Ca 2 * -signaling conditions (Celio, M.R. et al. 
(1996) Guidebook to Calcium-binding Proteins , Oxford University Press, New York NY, pp. 222- 
224). 
Cvclins 

3 o Cell division is the fundamental process by which all living things grow and reproduce. In 

most organisms, the cell cycle consists of three principle steps; interphase, mitosis, and cytokinesis. 
Interphase, involves preparations for cell division, replication of the DNA and production of essential 
proteins. In mitosis, the nuclear material is divided and separates to opposite sides of the cell. 
Cytokinesis is the final division and fission of the cell cytoplasm to produce the daughter cells. 
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The entry and exit of a cell from mitosis is regulated by the synthesis and destruction of a 
family of activating proteins called cyclins. Cyclins act by binding to and activating a group of 
cyclin-dependent protein kinases (Cdks) which then phosphorylate and activate selected proteins 
involved in the mitotic process. Several types of cyclins exist. (Ciechanover, A. (1994) Cell 
5 79:13-21.) Two principle types are mitotic cyclin, or cyclin B, which controls entry of the cell into 
mitosis, and Gl cyclin, which controls events that drive the ceil out of mitosis. 
Signal Complex Scaffolding Proteins 

Ceretain proteins in intracellular signaling pathways serve to link or cluster other proteins 
involved in the signaling cascade. A conserved protein domain called the PDZ domain has been 

1 o identified in various membrane-associated signaling proteins. This domain has been implicated in 

receptor and ion channel clustering and in the targeting of multiprotein signaling complexes to 
specialized functional regions of the eytesolie face of the plasma membrane. (For a review of PDZ 
domain-containing proteins, see Ponting, CP. et al. (1997) Bioessays 19:469-479.) A large 
proportion of PDZ domains are found in the eukaryotic MAGUK (membrane-associated guanylate 

15 kinase) protein family, members of which bind to the intracellular domains of receptors and channels. 
However, PDZ domains are also found in diverse membrane-localized proteins such as protein 
tyrosine phosphatases, serine/threonine kinases, G-protein cof actors, and synapse-associated proteins 
such as syntrophins and neuronal nitric oxide synthase (nNOS). Generally, about one to three PDZ 
domains are found in a given protein, although up to nine PDZ domains have been identified in a 

20 single protein. 

Membrane Transport Molecules 

Hie plasma membrane acts as a barrier to most molecules. Transport between the cytoplasm 
and the extracellular environment, and between the cytoplasm and lumenal spaces of cellular 

2 5 organelles requires specific transport proteins. Each transport protein carries a particular class of 

molecule, such as ions, sugars, or amino acids, and often is specific to a certain molecular species of 
the class. A variety of human inherited diseases are caused by a mutation in a transport protein. For 
example, cystinuria is an inherited disease that results from the inability to transport cystine, the 
disulfide-linked dimer of cysteine, from the urine into the blood. Accumulation of cystine in the 

3 o urine leads to the formation of cystine stones in the kidneys. 

Transport proteins are multi-pass transmembrane proteins, which either actively transport 
molecules across the membrane or passively allow them to cross. Active transport involves 
directional pumping of a solute across the membrane, usually against an electrochemical gradient 
Active transport is tightly coupled to a source of metabolic energy , such as ATP hydrolysis or an 
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electrochemically favorable ion gradient Passive transport involves the movement of a solute down 
its electrochemical gradient Transport proteins can be further classified as either carrier proteins or 
channel proteins. Carrier proteins, which can function in active or passive transport, bind to a specific 
solute to be transported and undergo a conformational change which transfers the bound solute across 
5 the membrane. Channel proteins, which only function in passive transport, form hydrophilic pores 
across the membrane. When the pores open, specific solutes, such as inorganic ions, pass through the 
membrane and down the electrochemical gradient of the solute. 

Carrier proteins which transport a single solute from one side of the membrane to the other 
are called uniporters. In contrast, coupled transporters link the transfer of one solute with 

1 o simultaneous or sequential transfer of a second solute, either in the same direction (symport) or in the 

opposite direction (antiport). For example, intestinal and kidney epithelium contains a variety of 
symporter systems driven by the sodium gradient that exists across the plasma membrane. Sodium 
moves into the cell down its electrochemical gradient and brings the solute into the cell with it. Hie 
sodium gradient that provides the driving force for solute uptake is maintained by the ubiquitous 
1 5 Na7K + ATPase. Sodium-coupled transporters include the mammalian glucose transporter (SGLT1), 
iodide transporter (NIS), and multivitamin transporter (SMVT). All three transporters have twelve 
putative transmembrane segments, extracellular glycosylation sites, and cytoplasmically-orientedN- 
and C-tennini. NIS plays a crucial role in the evaluation, diagnosis, and treatment of various thyroid 
pathologies because it is the molecular basis for radioiodide thyroid-imaging techniques and for 

2 o specific targeting of radioisotopes to the thyroid gland (Levy, O. et al. (1997) Proc. Natl. Acad. Sci. 

USA 94:5568-5573). SMVT is expressed in the intestinal mucosa, kidney, and placenta, and is 
implicated in the transport of the water-soluble vitamins, e.g., biotin and pantothenate (Prasad, P.D. et 
al. (1998) J. Biol. Chem. 273:7501-7506). 

Transporters play a major role in the regulation of pH, exaction of drugs, and the cellular 

2 5 K + /Na + balance. Monocarboxylate anion transporters are proton-coupled symporters with a broad 

substrate specificity that includes L-lactate, pyruvate, and the ketone bodies acetate, acetoacetate, and 
beta-hydroxybutyrate. At least seven isoforms have been identified to date. The isoforms are predicted 
to have twelve transmembrane (TM) helical domains with a large intracellular loop between TM6 and 
TM7, and play a critical role in maintaining intracellular pH by removing the protons that are produced 

3 o stoichiometrically with lactate during glycolysis. Tbt best characterized H(+)-monocarboxylate 

transporter is that of the erythrocyte membrane, which transports L-lactate and a wide range of other 
aliphatic monocarboxylates. Other cells possess H(+)-linked monocarboxylate transporters with 
differing substrate and inhibitor selectivities. In particular, cardiac muscle and tumor cells have 
transporters that differ in their values for certain substrates, including stereoselectivity for L- over 
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D-lactate, and in their sensitivity to inhibitors. There are Na(+)-monocarboxylate cotransporters on the 
luminal surface of intestinal and kidney epithelia, which allow the uptake of lactate, pyruvate* and 
ketone bodies in these tissues. In addition, there are specific and selective transporters for organic 
cations and organic anions in organs including the kidney, intestine and liver. Organic anion 
transporters are selective for hydrophobic, charged molecules with electron-attracting side groups. 
Organic cation transporters, such as the ammonium transporter, mediate the secretion of a variety of 
drugs and endogenous metabolites, and contribute to the maintenance of intercellular pH. (Poole, R.C. 
and A.P. Halestrap (1993) Am. J. PhysioL 264:C761-C782; Price, N.T. et al. (1998) Biochem. J. 
329:321-328; and Martindle, K. and I. Haggstrom (1993) J. BiotechnoL 30: 339-350.) 

The largest and most diverse family of transport proteins known is the ATP-binding cassette 
(ABC) transporters. As a family, ABC transporters can transport substances that differ markedly in 
chemical structure and size, ranging from small molecules such as ions, sugars, amino acids, peptides, 
and phospholipids, to lipopeptides, large proteins, and complex hydrophobic drugs. ABC proteins 
consist of four modules: two nucleotide-binding domains (NBD), which hydrolyze ATP to supply the 
energy required for transport, and two membrane-spanning domains (MSD), each containing six 
putative transmembrane segments. These four modules may be encoded by a single gene, as is the case 
for the cystic fibrosis transmembrane regulator (CFTR), or by separate genes. When encoded by 
separate genes, each gene product contains a single NBD and MSD. These << half-mole£ules" form 
homo- and heterodimers, such as Tapl and Tap2, the endoplasmic reticulum-based major 
histocompatibility (MHC) peptide transport system. Several genetic diseases are attributed to defects in 
ABC transporters, such as the following diseases and thdr corresponding proteins: cystic fibrosis 
(CFTR, an ion channel), adrenoleukodystrophy (adrenoleukodystrqphy protein, ALDP), Zellweger 
syndrome (peroxisomal membrane protein-70, PMP70), and hyperinsulinemic hypoglycemia 
(sulfonylurea receptor, SUR). Overexpressian of the multidrug resistance (MDR) protein, another 
ABC transporter, in human cancer ceils makes the cells resistant to a variety of cytotoxic drugs used in 
chemotherapy (Taglight, D. and S. Michaelis (1998) Meth. Enzymol. 292:131-163). 

Transport of fatty acids across the plasma membrane can occur by diffusion, a high capacity, 
low affinity process. However, under normal physiological conditions a significant fraction of fatty 
acid transport appears to occur via a high affinity, low capacity protein-mediated transport process. 
Fatty acid transport protein (FATP), an integral membrane protein with four transmembrane segments, 
is expressed in tissues exhibiting high levels of plasma membrane fatty acid flux, such as muscle, heart, 
and adipose. Expression of FATP is upregulated in 3T3-L1 cells during adipose conversion, and 
expression in COS7 fibroblasts elevates uptake of long-chain fatty acids (Hui, T.Y. et al. (1998) J. 
Biol. Chem. 273:27420-27429). 
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Ion Channels 

The electrical potential of a cell is generated and maintained by controlling the movement of 
ions across the plasma membrane. The movement of ions requires ion channels, which form an ion- 
selective pore within the membrane. There are two basic types of ion channels, ion transporters and 
5 gated ion channels. Ion transporters utilize the energy obtained from ATP hydrolysis to actively 
transport an ion against the ion's concentration gradient. Gated ion channels allow passive flow of an 
ion down the ion* s electrochemical gradient under restricted conditions. Together, these types of ion 
channels generate, maintain, and utilize an electrochemical gradient that is used in 1) electrical impulse 
conduction down the axon of a nerve cell, 2) transport of molecules into cells against concentration 

1 o gradients, 3) initiation of muscle contraction, and 4) endocrine cell secretion. 

Ion transporters generate and maintain the resting electrical potential of a cell. Utilizing the 
energy derived from ATP hydrolysis, they transport ions against the ion's concentration gradient. 
These transmembrane ATPases are divided into three families. The phosphorylated (P) class ion 
transporters, including Na + -K + ATPase, Ca 2+ -ATPase, and H + -ATPase, are activated by a 
15 phosphorylation event. P-class ion transporters are responsible for maintaining resting potential 

distributions such that cytosolic concentrations of Na + and Ca 2 * are low and cytosolic concentration of 
K* is high. The vacuolar (V) class of ion transporters includes H + pumps on intracellular organdies, 
such as lysosomes and Golgi. V-class ion transporters are responsible for generating the low pH within 
the lumen of these organelles that is required for function. The coupling factor (F) class consists of H + 

2 o pumps in the mitochondria. F-class ion transporters utilize a proton gradient to generate ATP from 

ADP and inorganic phosphate (Pj). 

The resting potential of the cell is utilized in many processes involving carrier proteins and 

gated ion channels. Carrier proteins utilize the resting potential to transport molecules into and out of 

the cell. Amino acid and glucose transport into many cells is linked to sodium ion co-transport 
2 5 (symport) so that the movement of Na + down an electrochemical gradient drives transport of the other 

molecule up a concentration gradient. Similarly, cardiac muscle links transfer of Ca 2+ out of the cell 

with transport of Na + into the cell (antiport). 

Ion channels share common structural and mechanistic themes. The channel consists of four or 

five subunits or protein monomers that are arranged like a barrel in the plasma membrane. Each 
30 subunit typically consists of six potential transmembrane segments (SI, S2, S3, S4, S5, and S6). The 

center of the barrel forms a pore lined by a-helices or p-strands. The side chains of the amino acid 

residues comprising the a-helices or p-strands establish the charge (cation or anion) selectivity of the 

channel. The degree of selectivity, or what specific ions are allowed to pass through the channel, 

depends on the diameter of the narrowest part of the pore. 
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Gated ion channels control ion flow by regulating the opening and closing of pores. These 
channels are categorized according to the manna: of regulating the gating function Mechanically-gated 
channels open pores in response to mechanical stress, voltage-gated channels open pores in response to 
changes in membrane potential, and ligand-gated channels open pores in the presence of a specific ion, 
5 nucleotide, or neurotransmitter. 

Voltage-gated Na + and K + channels are necessary for the function of electrically excitable cells, 
such as nerve and muscle cells. Action potentials, which lead to neurotransmitter release and muscle 
contraction, arise from large, transient changes in the permeability of the membrane to Na + and K + ions. 
Depolarization of the membrane beyond the threshold level opens voltage-gated Na + channels. Sodium 

1 o ions flow into the cell, further depolarizing the membrane and opening more voltage-gated Na + 

channels, which propagates the depolarization down the length of the ceil. Depolarization also opens 
voltage-gated potassium channels. Consequently, potassium ions flow outward, which leads to 
repolarization of the membrane. Voltage-gated channels utilize charged residues in the fourth 
transmembrane segment (S4) to sense voltage change. The open state lasts only about 1 millisecond, at 
1 5 which time the channel spontaneously converts into an inactive state that cannot be opened irrespective 
of the membrane potential. Inactivation is mediated by the channel's N-tenninus, which acts as a plug 
that closes the pore. The transition from an inactive to a closed state requires a return to resting 
potential. 

Voltage-gated Na + channels are heterotrimeric complexes composed of a 260 kDa pore forming 
20 a subunit that associates with two smaller auxiliary subunits, pi and 02. The |J2 subunit is an integral 
membrane glycoprotein that contains an extracellular Ig domain, and its association with a and pi 
subunits correlates with increased functional expression of the channel, a change in its gating 
properties, and an increase in whole cell capacitance due to an increase in membrane surface area. 
(Isom, L.L. et al. (1995) Cell 83:433-442.) 

2 5 Voltage-gated Ca 2 * channels are involved in presynaptic neurotransmitter release, and heart 

and skeletal muscle contraction. The voltage-gated Ca 2 * channels from skeletal muscle (L-type) and 
brain (N-type) have been purified, and though their functions differ dramatically, they have similar 
subunit compositions. The channels are composed of three subunits. The a x subunit forms the 
membrane pore and voltage sensor, while the and p subunits modulate the voltage-dependence, 

3 o gating properties, and the current amplitude of the channel. These subunits are encoded by at least six 

o lf one 028, and four p genes. A fourth subunit, y, has been identified in skeletal muscle. (Walker, D. 
et al. (1998) J. Biol. Chem. 273:2361-2367; and Jay, S.D. et al. (1990) Science 248:490-492.) 

Chloride channels are necessary in endocrine secretion and in regulation of cytosolic and 
organelle pH. In secretory epithelial cells, CI* enters the cell across a basolateral membrane through an 

69 



WO 01/62927 



PCT/US01/06059 



Na + , KVC1 * cotransporter, accumulating in the cell above its dectrochemical equilibrium concentratioa 
Secretion of CI " from the apical surface* in response to hormonal stimulation, leads to flow of Na + and 
water into the secretory lumen The cystic fibrosis transmembrane conductance regulator (CFTR) is a 
chloride channel encoded by the gene for cystic fibrosis, a common fatal genetic disorder in humans. 
5 Loss of CFTR function decreases transepithelial water secretion and, as a result, the layers of mucus 
that coat the respiratory tree, pancreatic ducts, and intestine are dehydrated and difficult to clear. The 
resulting blockage of these sites leads to pancreatic insirificiency, ''meconium ileus", and devastating 
"chronic obstructive pulmonary disease" (Al-Awqati, Q. et al. (1992) J. Exp. Biol. 172:245-266). 

Many intracellular organelles contain H + -ATPase pumps that generate transmembrane pH and 

1 o electrochemical differences by moving protons from the cytosol to the organelle lumen. If the 

membrane of the organelle is permeable to other ions, then the electrochemical gradient can be 
abrogated without affecting the pH dirTereaiiial. In fact, removal of die dectrochemicai barrier allows 
more H + to be pumped across the membrane, increasing the pH differential. CI* is the sole counterion 
of H + translocation in a number of organelles, including chromaffin granules, Golgi vesicles, 
1 5 lysosomes, and endosomes. Functions that require a low vacuolar pH include uptake of small 

molecules such as biogenic amines in chromaffin granules, processing of vacuolar constituents such as 
pro-hormones by proteolytic enzymes, and protein degradation in lysosomes (Al-Awqati, supra) . 

Ligand-gated channels open their pores when an extracellular or intracellular mediator binds to 
the channel. Neurotransmitter-gated channels are channels that open when a neurotransmitter binds to 

2 o their extracellular domain. These channels exist in the postsynaptic membrane of nerve or muscle cells. 

There are two types of neurotransmitter-gated channels. Sodium channels open in response to 
excitatory neurotransmitters, such as acetylcholine, glutamate > and serotonin. This opening causes an 
influx of Na + and produces the initial localized depolarization that activates the voltage-gated channels 
and starts the action potential. Chloride channels open in response to inhibitory neurotransmitters, such 

25 as y-aminobutyric acid (GAB A) and glycine, leading to hyperpolarization of the membrane and the 
subsequent generation of an action potential. 

Ligand-gated channels can be regulated by intracellular second messengers. Calcium-activated 
K + channels are gated by internal calcium ions. In nerve cells, an influx of calcium during 
depolarization opens K + channels to modulate the magnitude of the action potential (Ishi, T.M. et al. 

30 (1997) Proc. Natl. Acad Sci. USA 94:1 1651-1 1656). Cyclic nucleotide-gated (CNG) channels are 
gated by cytosolic cyclic nucleotides. The best examples of these are the cAMP-gated Na* channels 
involved in olfaction and the cGMP-gated cation channels involved in vision. Both systems involve 
ligand-mediated activation of a G-protein coupled receptor which thai alters the level of cyclic 
nucleotide within the cell. 
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Ion channels are expressed in a number of tissues where they are implicated in a variety of 
processes. CNG channels, while abundantly expressed in photoreceptor and olfactory sensory cells, are 
also found in kidney, lung, pineal, retinal ganglion cells, testis, aorta, and braia Calcium-activated K + 
channels may be responsible for the vasodilatory effects of bradykinin in the kidney and for shunting 
5 excess K + from brain capillary endothelial cells into the blood. They are also implicated in repolarizing 
granulocytes after agonist-stimulated depolarization (Ishi, supra) : Ion channels have been the target for 
many drug therapies. Neurotransmitter-gated channels have been targeted in therapies for treatment of 
insomnia, anxiety, depression, and schizophrenia. Voltage-gated channels have been targeted in 
therapies for arrhythmia, ischemic stroke, head trauma, and neurodegenerative disease (Taylor, CP. 
10 and L.S. Narasimhan (1997) Adv. Pharmacol. 39:47-98). 
Disease Correlation 

The etiology of numerous human diseases and disorders can be attributed to defects in the 
transport of molecules across membranes. Defects in the trafficking of membrane-bound transporters 
and ion channels are associated with several disorders, e.g. cystic fibrosis, glucose-galactose 
1 5 malabsorption syndrome, hypercholesterolemia, von Gierke disease, and certain forms of diabetes 
mdlitus. Single-gene defect diseases resulting in an inability to transport small molecules across 
membranes include, e,g., cystinuria, iminogjycinuria, Hartup disease, and Fanconi disease (vant Hoff, 
W.G. (1996) Exp. Nephrol. 4:253-262; Talente, G.M. et al. (1994) Ann. Intern. Med. 120:218-226; 
and Chillon, M. et al. (1995) New Engl. J. Med. 332:1475-1480). 

20 

Protein Modification and Maintenance Molecules 

The cellular processes regulating modification and maintenance of protein molecules 
coordinate their conformation, stabilization, and degradation. Each of these processes is mediated by 
key enzymes or proteins such as proteases, protease inhibitors, transferases, isomerases, and 

2 5 molecular chaperones. 

Proteases 

Proteases cleave proteins and peptides at the peptide bond that forms the backbone of the 
peptide and protein chain. Proteolytic processing is essential to cell growth, differentiation, 
remodeling, and homeostasis as well as inflammation and immune response. Typical protein half- 

3 o lives range from hours to a few days, so that within all living cells, precursor proteins are being 

cleaved to their active f onn, signal sequences proteolytically removed from targeted proteins, and 
aged or defective proteins degraded by proteolysis. Proteases function in bacterial, parasitic, and viral 
invasion and replication within a host. Four principal categories of mammalian proteases have been 
identified based on active site structure, mechanism of action, and overall three-dimensional structure. 
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(Beynon, R.J. and J.S. Bond (1994) Proteolytic Enzymes: A Practical Approach. Oxford University 
Press, New York NY, pp. 1-5). 

Hie serine proteases (SPs) have a serine residue, usually within a conserved sequence, in an 
active site composed of the serine, an aspartate, and a histidine residue. SPs include the digestive 
enzymes trypsin and chymotrypsin, components of the complement cascade and the blood-clotting 
cascade, and enzymes that control extracellular protein degradation. The main SP sub-families are 
trypases, which cleave after arginine or lysine; aspartases, which cleave after aspartate; chymases, 
which cleave after phenylalanine or leucine; metases, which cleavage after methionine; and serases 
which cleave after serine. Enterokinase, the initiator of intestinal digestion, is a serine protease found 
in the intestinal brush border, where it cleaves the acidic propeptide from trypsinogen to yield active 
trypsin (Kitamoto, Y. et al. (1994) Proc. Natl. Acad. Sci. USA 91:7588-7592). 
Procarboxypeptidase, a lysosomal serine peptidase that cleaves peptides such as angiotensin H and 
HI and [des-Arg9] bradykinin, shares sequence homology with members of both the serine 
carboxypeptidase andprolylendopeptidase families (Tan, F. et al. (1993) J. Biol. Chem. 268:16631- 
16638). 

Cysteine proteases (CPs) have a cysteine as the major catalytic residue at an active site where 
catalysis proceeds via an intermediate thiol ester and is facilitated by adjacent histidine and aspartic 
acid residues. CPs are involved in diverse cellular processes ranging from the processing of precursor 
proteins to intracellular degradation. Mammalian CPs include lysosomal cathepsins and cytosolic 
calcium activated proteases, calpains. CPs are produced by monocytes, macrophages and other cells 
of the immune system which migrate to sites of inflammation and secrete molecules involved in 
tissue repair. Overabundance of these repair molecules plays a role in certain disorders. In 
autoimmune diseases such as rheumatoid arthritis, secretion of the cysteine peptidase cathepsin C 
degrades collagen, laminin, elastin and other structural proteins found in the extracellular matrix of 
bones. 

Aspartic proteases are members of the cathepsin family of lysosomal proteases and include 
pepsin A, gastricsin, chymosin, renin, and cathepsins D and E. Aspartic proteases have a pair of 
aspartic acid residues in the active site> and are most active in the pH 2 - 3 range, in which one of the 
aspartate residues is ionized, the other un-ionized. Aspartic proteases include bacterial 
penicillopepsin, mammalian pepsin, renin, chymosin, and certain fungal proteases. Abnormal 
regulation and expression of cathepsins is evident in various inflammatory disease states. In cells 
isolated from inflamed synovia, the mRNA for stromdysin, cytokines, TIMP-1, cathepsin, gelatinase, 
and other molecules is preferentially expressed Expression of cathepsins L and D is elevated in 
synovial tissues from patients with rheumatoid arthritis and osteoarthritis. Cathepsin L expression may 
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also contribute to the influx of mononuclear cells which exacerbates the destruction of the rheumatoid 
synovium. (Keyszer, G.M. (1995) Arthritis Rheum. 38:976-984.) The increased expression and 
differential regulation of the cathepsins are linked to the metastatic potential of a variety of cancers and 
as such are of therapeutic and prognostic interest (Chambers, A.F. et aL (1993) Crit Rev. Oncog. 
4:95-114). 

Metalloproteases have active sites that include two glutamic acid residues and one histidine 
residue that serve as binding sites for zinc. Carboxypeptidases A and B are the principal mammalian 
metalloproteases. Both are exoproteases of similar structure and active sites. Carboxypeptidase A, 
like chymotrypsin, prefers C-terminal aromatic and aliphatic side chains of hydrophobic nature, 
whereas carboxypeptidase B is directed toward basic arginine and lysine residues. Glycoprotease 
(GCP), or O-sialoglycoprotein endopeptidase, is a metallopeptidase which specifically cleaves 
O-sialoglycoproteins such as glycophoiin A. Another metallopeptidase, placental leucine 
aminopeptidase (P-LAP) degrades several peptide hormones such as oxytocin and vasopressin, 
suggesting a role in maintaining homeostasis during pregnancy, and is expressed in several tissues 
(Rogi, T. et al. (1996) J. Biol. Chem. 271:56-61). 

Ubiquitin proteases are associated with the ubiquitin conjugation system (UCS), a major 
pathway for the degradation of cellular proteins in eukaryotic cells and some bacteria: Hie UCS 
mediates the elimination of abnormal proteins and regulates the half-lives of important regulatory 
proteins that control cellular processes such as gene transcription and cell cycle progression. In the • 
UCS pathway, proteins targeted for degradation are conjugated to a ubiquitin, a small heat stable ' 
protein. Hie ubiquitinated protein is then recognized and degraded by proteasome, a large, 
multisubunit proteolytic enzyme complex, and ubiquitin is released for reutilization by ubiquitin 
protease. The UCS is implicated in the degradation of mitotic cyclic kinases, oncoproteins, tumor 
suppressor genes such as p53, viral proteins, cell surface receptors associated with signal 
transduction, transcriptional regulators, and mutated or damaged proteins (Ciechanover, A. (1994) 
Cell 79: 1 3-21). A murine proto-oncogene, Unp, encodes a nuclear ubiquitin protease whose 
overexpression leads to oncogenic transformation of NIH3T3 cells, and the human homolog of this 
gene is consistently elevated in small cell tumors and adenocarcinomas of the lung (Gray, D.A. 
(1995) Oncogene 10:2179-2183). 
Signal Peptidases 

The mechanism for the translocation process into the endoplasmic reticulum (ER) involves 
the recognition of an N-terminal signal peptide on the elongating protein. Hie signal peptide directs 
the protein and attached ribosome to a receptor on the ER membrane. The polypeptide chain passes 
through a pore in the ER membrane into the lumen while the N-terminal signal peptide remains 
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attached at the membrane surface. The process is completed when signal peptidase located inside the 
ER cleaves the signal peptide from the protein and releases the protein into the lumen. 
Protease Inhibitors 

Protease inhibitors and other regulators of protease activity control the activity and effects of 
proteases. Protease inhibitors have been shown to control pathogenesis in animal models of 
proteolytic disorders (Murphy, G. (1991) Agents Actions Suppl. 35:69-76). Low levels of the 
cystatins, low molecular weight inhibitors of the cysteine proteases, correlate with malignant 
progression of tumors. (Calkins, C. et al (1995) Biol. Biochem. Hoppe Seyler 376:71-80). Serpins 
are inhibitors of mammalian plasma serine proteases. Many serpins serve to regulate the blood 
clotting cascade and/or the complement cascade in mammals. Sp32 is a positive regulator of the 
mammalian acrosomal protease, acrosin, that binds the proenzyme, proacrosin, and thereby aides in 
packaging the enzyme into the acrosomal matrix (Baba, T. et al (1994) X Biol. Chem. 269:10133- - 
10140). The Kunitz family of serine protease inhibitors are characterized by one or more "Kunitz 
domains" containing a series of cysteine residues that are regularly spaced over approximately 50 
amino acid residues and form three intrachain disulfide bonds. Members of this family include 
aprotinin, tissue factor pathway inhibitor (TFPI-1 and TFPI-2), inter-a-trypsin inhibitor, and bikunin. 
(Marlor, C.W. et al. (1997) J. Biol. Chem. 272:12202-12208.) Members of this family are potent 
inhibitors (in the nanomolar range) against serine proteases such as kallikredn and plasmin Aprotinin 
has clinical utility in reduction of perioperative blood loss. 

A major portion of all proteins synthesized in eukaryotic cells are synthesized on the 
cytosolic surface of the endoplasmic reticulum (ER). Before these immature proteins are distributed 
to other organelles in the cell or are secreted, they must be transported into the interior lumen of the 
ER where post-translational modifications are performed. These modifications include protein folding 
and the formation of disulfide bonds, and N-linked gjycosylations. 
Protein Isomerases 

Protein folding in the ER is aided by two principal types of protein isomerases, protein 
disulfide isomerase (PDI), and peptidyl-prolyl isomerase (PPI). PDI catalyzes the oxidation of free 
sulfhydryl groups in cysteine residues to form intramolecular disulfide bonds in proteins. PPI, an 
enzyme that catalyzes the isomerization of certain proline imidic bonds in oligopeptides and proteins, 
is considered to govern one of the rate limiting steps in the folding of many proteins to their final 
functional conformation. Hie cyclophilins represent a major class of PPI that was originally 
identified as the major receptor for the immunosuppressive drug cyclosporin A (Handschumacher, 
R.E. et al. (1984) Science 226: 544-547). 
Protein Glvcosvlation 
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Hie glycosylatioa of most soluble secreted and membrane-bound proteins by 
oligosaccharides linked to asparagine residues in proteins is also performed in the ER. This reaction 
is catalyzed by a membrane-bound enzyme, oligosaccharyl transferase. Although the exact purpose 
of this "N-linked" glycosylation is unknown, the presence of oligosaccharides tends to make a 
glycoprotein resistant to protease digestion. In addition, oligosaccharides attached to cell-surface 
proteins called selectins are known to function in cell-cell adhesion processes (Alberts, B. et al. 
(1994) Molecular Biology of the Cell, Garland Publishing Co., New York NY, p.608). "O-linked" 
glycosylation of proteins also occurs in the ER by the addition of N-acetylgalactosamine to the 
hydroxyl group of a serine or threonine residue followed by the sequential addition of other sugar 
residues to the first. This process is catalysed by a series of glycosyltransferases each specific for a 
particular donor sugar nucleotide and acceptor molecule (Lodish, H. et aL (1995) Molecular Cell 
Biology . W.H. Freeman and Co., New York NY, pp.700-708). In many cases, both N- and O-linked 
oligosaccharides appear to be required for the secretion of proteins or the movement of plasma 
membrane glycoproteins to the cell surface. 

An additional glycosylation mechanism operates in the ER specifically to target lysosomal 
enzymes to ly sosomes and prevent their secretion. Lysosomal enzymes in the ER receive an N-linked 
oligosaccharide, like plasma membrane and secreted proteins, but are then phosphorylated on one or 
two mannose residues. The phosphorylation of mannose residues occurs in two steps, the first step 
being the addition of an N-acetylglucosamine phosphate residue by N-acetylglucosamine 
phosphotransferase, and the second the removal of the N-acetylglucosamine group by 
phosphodiesterase. The phosphorylated mannose residue then targets the lysosomal enzyme to a 
mannose 6-phosphate receptor which transports it to alysosome vesicle (Lodish, supra, pp. 708-711). 
Chaperones 

Molecular chaperones are proteins that aid in the proper folding of immature proteins and 
refolding of improperly folded ones, the assembly of protein subunits, and in the transport of 
unfolded proteins across membranes. Chaperones are also called heat-shock proteins (hsp) because of 
their tendency to be expressed in dramatically increased amounts following brief exposure of cells to 
elevated temperatures. This latter property most likely reflects their need in the refolding of proteins 
that have become denatured by the high temperatures. Chaperones may be divided into several 
classes according to their location, function, and molecular weight, and include hsp60, TCP1, hsp70, 
hsp40 (also called DnaJ), and hsp90. For example, hsp90 binds to steroid hormone receptors, 
represses transcription in the absence of the ligand, and provides proper folding of the ligand-binding 
domain of the receptor in the presence of the hormone (Burston, S.G. and A.R. Clarke (1995) Essays 
Biochem. 29: 125-136). Hsp60 and hsp70 chaperones aid in the transport and folding of newly 



75 



WO 01/62927 



PCTYUS01/06059 



synthesized proteins. Hsp70 acts early in protein folding, binding a newly synthesized protein before 
it leaves the ribosome and transporting the protein to the mitochondria or ER before releasing the 
folded protein. Hsp60, along with hsplO, binds misfolded proteins and gives them the opportunity to 
refold correctly. All chaperones share an affinity for hydrophobic patches on incompletely folded 
proteins and the ability to hydrolyze ATP. The energy of ATP hydrolysis is used to release the hsp- 
bound protein in its properly folded state (Alberts, supra , pp 214, 571-572). 

Nucleic Acid Synthesis and Modification Molecules 

Polymerases 

DNA and RNA replication are critical processes for cell replication and function. DNA and 
RN A replication are mediated by the enzymes DNA and RNA polymerase, respectively, by a 
"templatiiig" process in which the nucleotide sequence of a DNA or RNA strand is copied by 
complementary base-pairing into a complementary nucleic acid sequence of either DNA or RNA. 
However, there are fundamental' differences between the two processes. 

DNA polymerase catalyzes the stepwise addition of a deoxyribonucleotide to the 3'-OH end 
of a polynucleotide strand (the primer strand) that is paired to a second (template) strand. The new 
DNA strand therefore grows in the 5' to 3* direction (Alberts, B.etal. (1994) The Molecular Biology 
of the Cell , Garland Publishing Inc., New York NY, pp. 25 1-254). The substrates for the 
polymerization reaction are the corresponding deoxynucleotide triphosphates which must base-pair 
with the correct nucleotide on the template strand in order to be recognized by the polymerase. 
Because DNA exists as a double-stranded helix, each of the two strands may serve as a template for 
the formation of a new complementary strand. Each of the two daughter cells of the dividing cell 
therefore inherits a new DNA double helix containing one old and one new strand. Thus, DNA is 
said to be replicated "semiconservatively " by DNA polymerase. In addition to the synthesis of new 
DNA, DNA polymerase is also involved in the repair of damaged DNA as discussed below under 
"Ligases." 

In contrast to DNA polymerase, RNA polymerase uses a DNA template strand to "transcribe" 
DNA into RNA using ribonucleotide triphosphates as substrates. Like DNA polymerization, RNA 
polymerization proceeds in a 5' to 3' direction by addition of a ribonucleoside monophosphate to the 
3'-OH end of a growing RNA chain. DNA transcription generates messenger RNAs (mRNA) that 
carry information for protein synthesis, as well as the transfer, ribosomal, and other RNAs that have 
structural or catalytic functions. In eukaryotes, three discrete RNA polymerases synthesize the three 
different types of RNA (Alberts, supra , pp. 367-368). RNA polymerase I makes the large ribosomal 
RNAs, RNA polymerase II makes the mRNAs that will be translated into proteins, and RNA 



76 



WO 01/62927 



PCT/US01/06059 



polymerase III makes a variety of small, stable RNAs, including 5S ribosomal RNA and the transfer 
RNAs (tRNA). In all cases, RNA synthesis is initiated by binding of the RNA polymerase to a 
promoter region on the DNA and synthesis begins at a start site within the promoter. Synthesis is 
completed at a broad, general stop or termination region in the DNA where both the polymerase and 
5 the completed RNA chain are released 
Ligases 

DNA repair is the process by which accidental base changes, such as those produced by 
oxidative damage, hydrolytic attack, or uncontrolled methylation of DNA are corrected before 
replication or transcription of the DNA can occur. Because of the efficiency of the DNA repair 

1 o process, fewer than one in one thousand accidental base changes causes a mutation (Alberts, supra , 

pp. 245-249). Hie three steps common to most types of DNA repair are (1) excision of the damaged 
or altered base or nucleotide by DNA nucleases* leaving a gap; (2) insertion of the correct nucleotide 
in this gap by DNA polymerase using the complementary strand as the template; and (3) sealing the 
break left between the inserted nucleotide(s) and the existing DNA strand by DNA ligase. In the last 
1 5 reaction, DNA ligase uses the energy from ATP hydrolysis to activate the 5 1 end of the broken 
phosphodiester bond before forming the new bond with the 3'-OH of the DNA strand. In Bloom's 
syndrome, an inherited human disease, individuals are partially deficient in DNA ligation and 
consequently have an increased incidence of cancer (Alberts, supra , p. 247). 
Nucleases 

2 o Nucleases comprise both enzymes that hydrolyze DNA (DNase) and RNA (RNase). They 

serve different purposes in nucleic acid metabolism. Nucleases hydrolyze the phosphodiester bonds 
between adjacent nucleotides either at internal positions (endonucleases) or at the terminal 3' or 5 1 
nucleotide positions (exonucleases). A DNA exonuclease activity in DNA polymerase, for example, 
serves to remove improperly paired nucleotides attached to the 3'-OH end of the growing DNA strand 

25 by the polymerase and thereby serves a "proofreading" function. As mentioned above, DNA 
endonuclease activity is involved in the excision step of the DNA repair process. 

RNases also serve a variety of functions. For example, RNase P is a ribonucleoprotem 
enzyme which cleaves the 5' end of pre-tRNAs as part of their maturation process. RNase H digests 
the RNA strand of an RNA/DNA hybrid. Such hybrids occur in cells invaded by retroviruses, and 

30 RNase H is an important enzyme in the retroviral replication cycle. Pancreatic RNase secreted by the 
pancreas into the intestine hydrolyzes RNA present in ingested foods. RNase activity in serum and 
cell extracts is elevated in a variety of cancers and infectious diseases (Schein, C.H. (1997) Nat 
Biotechnol. 15:529-536). Regulation of RNase activity is being investigated as a means to control 
tumor angiogenesis, allergic reactions, viral infection and replication, and fungal infections. 
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Methvlases 

Methylation of specific nucleotides occurs in both DNA and RNA, and serves different 
functions in the two macromolecules. Methylation of cytosine residues to form 5 -methyl cytosine in 
DNA occurs specifically at CG sequences which are base-paired with one another in the DNA double- 
5 helix. This pattern of methylation is passed from generation to generation during DNA replication by 
an enzyme called "maintenance methylase" that acts preferentially on those CG sequences that are 
base-paired with a CG sequence that is already methylated. Such methylation appears to distinguish 
active from inactive genes by preventing the binding of regulatory proteins that "turn on" the gene, 
but permit the binding of proteins that inactivate the gene (Alberts, supra , pp. 448-451). In RNA 

1 o metabolism, "tRNA methylase" produces one of several nucleotide modifications in tRN A that affect 

the conformation and base-pairing of the molecule and facilitate the recognition of the appropriate 
mRNA codons by specific tRNAs. The primary methylation pattern is the dimcthyiation of guanine 
residues to form N,N-dimethyl guanine. 
Helicases and Single-Stranded Binding Proteins 
1 5 Helicases are enzymes that destabilize and unwind double helix structures in both DNA and 

RNA. . Since DNA replication occurs more or less simultaneously on both strands, the two strands 
must first separate to generate a replication "fork" for DNA polymerase to act on. Two types of 
replication proteins contribute to this process, DNA helicases and single-stranded binding proteins. 
DNA helicases hydrolyze ATP and use the energy of hydrolysis to separate the DNA strands. Single- 

2 o stranded binding proteins (SSBs) then bind to the exposed DNA strands without covering the bases, 

thereby temporarily stabilizing them for templating by the DNA polymerase (Alberts, supra , pp. 255- 
256). 

RNA helicases also alter and regulate RNA conformation and secondary structure. Like the 
DNA helicases, RNA helicases utilize energy derived from ATP hydrolysis to destabilize and unwind 

2 5 RNA duplexes. The most well-characterized and ubiquitous family of RNA helicases is the DEAD- 

box family, so named for the conserved B-type ATP-binding motif which is diagnostic of proteins in 
this family. Over 40 DEAD-box helicases have been identified in organisms as diverse as bacteria, 
insects, yeast, amphibians, mammals, and plants. DEAD-box helicases function in diverse processes 
such as translation initiation, splicing, ribosome assembly, and RNA editing, transport, and stability. 

3 o Some DEAD-box helicases play tissue- and stage-specific roles in spermatogenesis and 

embryogenesis. Overexpression of the DEAD-box 1 protein (DDX1) may play a role in the 
progression of neuroblastoma (Nb) and retinoblastoma (Rb) tumors (Godbout, R. et al. (1998) J. Biol. 
Chem. 273:21 161-21 168). Hiese observations suggest that DDX1 may promote or enhance tumor 
progression by altering the normal secondary structure and expression levels of RNA in cancer cells. 
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Other DEAD-box helicases have been implicated either directly or indirectly in tumorigenesis 
(Discussed in Godbout, supra) . For example, murine p68 is mutated in ultraviolet light-induced 
tumors, and human DDX6 is located at a chromosomal breakpoint associated with B-cell lymphoma. 
Similarly, a chimeric protein comprised of DDX10 and NUP98, a nucleoporin protein, may be 
5 involved in the pathogenesis of certain myeloid malignancies. 
Topoisomerases 

Besides the need to separate DNA strands prior to replication, the two strands must be 
"unwound" from one another prior to their separation by DNA helicases. This function is performed 
by proteins known as DNA topoisomerases. DNA topoisomerase effectively acts as a reversible 

1 o nuclease that hydrolyzes a phosphodiesterase bond in a DNA strand, permitting the two strands to 

rotate freely about one another to remove the strain of the helix, and then rejoins the original 
phosphodiester bond between the two strands. Two types of DNA topoisomerase exist, types I and H. 
DNA Topoisomerase I causes a single-strand break in a DNA helix to allow the rotation of the two 
strands of the helix about the remaining phosphodiester bond in the opposite strand. DNA 

1 5 topoisomerase n causes a transient break in both strands of a DNA helix where two double helices 
cross over one another. This type of topoisomerase can efficiently separate two interlocked DNA 
circles (Alberts, supra, pp.260-262). Type II topoisomerases are largely confined to proliferating cells 
in eukaryotes, such as cancer cells. For this reason they are targets for anticancer drugs. 
Topoisomerase II has been implicated in multi-drug resistance (MDR) as it appears to aid in the repair 5 

20 of DNA damage inflicted by DNA binding agents such as doxorubicin and vincristine. 
Recombinases 

Genetic recombination is the process of rearranging DNA sequences within an organism's 
genome to provide genetic variation for the organism in response to changes in the environment 
DNA recombination allows variation in the particular combination of genes present in an individual' s 

2 5 genome, as well as the timing and level of expression of these genes (see Alberts, supra , pp. 263- 

273). Two broad classes of genetic recombination are commonly recognized, general recombination 
and site-specific recombination. General recombination involves genetic exchange between any 
homologous pair of DNA sequences usually located on two copies of the same chromosome. The 
process is aided by enzymes called recombinases that "nick" one strand of a DNA duplex more or less 

3 o randomly and permit exchange with the complementary strand of another duplex. The process does 

not normally change the arrangement of genes on a chromosome. In site-specific recombination, the 
recombinase recognizes specific nucleotide sequences present in one or both of the recombining 
molecules. Base-pairing is not involved in this form of recombination and therefore does not require 
DNA homology between the recombining molecules. Unlike general recombination, this form of 
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recombination can alter the relative positions of nucleotide sequences in chromosomes. 
Splicing Factors 

Various proteins are necessary for processing of transcribed RNAs in the nucleus. Pre- 
mRNA processing steps include capping at the 5* end with methylguanosine, polyadenylating the 3* 
5 end, and splicing to remove introns. The primary RNA transcript from DNA is a faithful copy of the 
gene containing both exon and intron sequences, and the latter sequences must be cut out of the RNA 
transcript to produce an mRNA that codes for a protein. This "splicing" of the mRNA sequence takes 
place in the nucleus with the aid of a large, multicomponent ribonucleoprotein complex known as a 
spliceosome. The spliceosomal complex is composed of five small nuclear ribonucleoprotein 

10 particles (snRNPs) designated Ul , U2, V4, U5, and U6, and a number of additional proteins. Each 
snRNP contains a single species of snRNA and about ten proteins. The RNA components of some 
snRNPs recognize and base pair with intron consensus sequences. The protein components mediate 
spliceosome assembly and the splicing reaction. Autoantibodies to snRNP proteins are found in the 
blood of patients with systemic lupus erythematosus (Stryer, L. (1995) Biochemistry. W.H. Freeman 

15 and Company, New York NY, p. 863). 

Adhesion Molecules 

The surface of a cell is rich in transmembrane proteoglycans, glycoproteins, glycolipids, and 
receptors. These macromolecules mediate adhesion with other cells and with components of the 
2 o extracellular matrix (ECM). The interaction of the cell with its surroundings profoundly influences 
cell shape, strength, flexibility, motility, and adhesion. These dynamic properties are intimately 
associated with signal transduction pathways controlling cell proliferation and differentiation, tissue 
construction, and embryonic development. 
Cadherins 

2 5 Cadherins comprise a family of calcium-dependent glycoproteins that function in mediating 

cell-cell adhesion in virtually all solid tissues of multicellular organisms. These proteins share 
multiple repeats of a cadherin-specific motif, and the repeats form the folding units of the cadherin 
extracellular domain. Cadherin molecules cooperate to form focal contacts, or adhesion plaques, 
between adjacent epithelial cells. The cadherin family includes the classical cadherins and 

30 protocadherins. Classical cadherins include the E-cadherin, N-cadherin, and P-cadherin subfamilies. 
E-cadherin is present on many types of epithelial cells and is especially important for embryonic 
development N-cadherin is present on nerve, muscle, and lens cells and is also critical for embryonic 
development P-cadherin is present on cells of the placenta and epidermis. Recent studies report that 
protocadherins are involved in a variety of cell-cell interactions (Suzuki, S.T. (1996) J. Cell Sci. 
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109:2609-261 1). Hie intracellular anchorage of cadherins is regulated by their dynamic association 
with catenins, a family of cytoplasmic signal transduction proteins associated with the acta 
cytoskeletoa The anchorage of cadherins to the actin cytoskeleton appears to be regulated by protein 
tyrosine phosphorylation, and the cadherins are the target of phosphorylation-induced junctional 
5 disassembly (Aberle, H. et al. (1996) J. Cell. Biochem. 61:5 14-523). 
Integrins 

Integrins are ubiquitous transmembrane adhesion molecules that link the ECM to the internal 
cytoskeleton. Integrins are composed of two noncovalently associated transmembrane glycoprotein 
subunits called a and p. Integrins function as receptors that play a role in signal transduction. For 

l o example, binding of integrin to its extracellular ligand may stimulate changes in intracellular calcium 
levels or protein kinase activity (Sjaastad, M.D. and W.J. Nelson (1997) BioEssays 19:47-55). At 
least ten cell surface receptors of the integrin family recognize the ECM component fibronectin, 
which is involved in many different biological processes including cell migration and embryogenesis 
(Johansson, S. et al. (1997) Front Biosci. 2:D126-D146). 

15 Lectins 

Lectins comprise a ubiquitous family of extracellular glycoproteins which bind cell surface 
carbohydrates specifically and reversibly, resulting in the agglutination of cells (reviewed in 
Drickamer, K. and MJE. Taylor (1993) Annu. Rev. Cell Biol. 9:237-264). This function is 
particularly important for activation of the immune response. Lectins mediate the agglutination and 

20 mitogenic stimulation of lymphocytes at sites of inflammation (Lasky, L.A (1991) J. Cell. Biochem. 
45:139-146; Paietta, E. et al. (1989) J. Immunol. 143:2850-2857). 

Lectins are further classified into subfamilies based on carbohydrate-binding specificity and 
other criteria. The galectin subfamily, in particular, includes lectins that bind P-galactoside 
carbohydrate moieties in a thiol-dependent manner (reviewed in Hadari, Y.R. et al. (1998) J. Biol. 

25 Chem. 270:3447-3453). Galectins are widely expressed and developmentally regulated. Because all 
galectins lack an N-terminal signal peptide, it is suggested that galectins are externalized through an 
atypical secretory mechanism. Two classes of galectins have been defined based on molecular weight 
and oligomerization properties. Small galectins form homodimers and are about 14 to 16 kilodaltons 
in mass, while large galectins are monomelic and about 29-37 kilodaltons. 

3 o Galectins contain a characteristic carbohydrate recognition domain (CRD). The CRD is 

about 140 amino acids and contains several stretches of about 1 - 10 amino acids which are highly 
conserved among all galectins. A particular 6-amino acid motif within the CRD contains conserved 
tryptophan and arginine residues which are critical for carbohydrate binding. Hie CRD of some 
galectins also contains cysteine residues which may be important for disulfide bond formation. 
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Secondary structure predictions indicate that the CRD forms several (J-sheets. 

Galectins play a number of roles in diseases and conditions associated with cell-cell and cell- 
matrix interactions. For example, certain galectins associate with sites of inflammation and bind to 
cell surface immunoglobulin E molecules. In addition, galectins may play an important role in cancer 
5 metastasis. Galectin overexpression is correlated with the metastatic potential of cancers in humans 
and mice. Moreover, anti-galectin antibodies inhibit processes associated with cell transformation, 
such as cell aggregation and anchorage-independent growth (See, for example, Su, Z.-Z. et al. (1996) 
Proc. Natl. Acad. Sci. USA 93:7252-7257). 
Selectins 

1 o Selectins, or LEC-C AMs, comprise a specialized lectin subfamily involved primarily in 

inflammation and leukocyte adhesion (Reviewed in Lasky, supra) . Selectins mediate the recruitment 
of leukocytes from the circulation to sites of acute inflammation and are expressed on the surface of 
vascular endothelial cells in response to cytokine signaling. Selectins bind to specific ligands on the 
leukocyte cell membrane and enable the leukocyte to adhere to and migrate along the endothelial 
is surface. Binding of selectin to its ligand leads to polarized rearrangement of the actin cytoskeleton 
and stimulates signal transduction within the leukocyte (Brenner, B. et al. (1997) Biochem. Biophys. 
Res. Commun. 231:802-807; Hidari, KJL et al. (1997) J. Biol. Chem. 272:28750-28756). Members 
of the selectin family possess three characteristic motifs: a lectin or carbohydrate recognition domain; 
an epidermal growth factor-like domain; and a variable number of short consensus repeats (scr or 

2 o "sushi" repeats) which are also present in complement regulatory proteins. The selectins include 

lymphocyte adhesion molecule- 1 (Lam-1 or L-selectin), endothelial leukocyte adhesion molecule- 1 
(ELAM-1 or E-selectin), and granule membrane protein- 140 (GMP-140 or P-selectin) (Johnston, G.I. 
et al. (1989) Cell 56:1033-1044). 

2 s Antigen Recognition Molecules 

AH vertebrates have developed sophisticated and complex immune systems that provide 
protection from viral, bacterial, fungal, and parasitic infections. A key feature of the immune system 
is its ability to distinguish foreign molecules, or antigens, from "self 1 molecules. This ability is 
mediated primarily by secreted and transmembrane proteins expressed by leukocytes (white blood 

3 o cells) such as lymphocytes, granulocytes, and monocytes. Most of these proteins belong to the 

immunoglobulin (Ig) superfamily, members of which contain one or more repeats of a conserved 
structural domain. This Ig domain is comprised of antiparallel p sheets joined by a disulfide bond in 
an arrangement called the Ig fold. Members of the Ig superfamily include T-cell receptors, major 
histocompatibility (MHC) proteins, antibodies, and immune cell-specific surface markers such as 
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CD4, CD8,andCD28. 

MHC proteins are cell surface markers that bind to and present foreign antigens to T cells. 
MHC molecules are classified as either class I or class IL Class I MHC molecules (MHC I) are 
expressed on the surface of almost all cells and are involved in the presentation of antigen to 
5 cytotoxic T cells. For example, a cell infected with virus will degrade intracellular viral proteins and 
express the protein fragments bound to MHC I molecules on the cell surface. The MHC I/antigen 
complex is recognized by cytotoxic T-cells which destroy the infected cell and the virus within. 
Class n MHC molecules are expressed primarily on specialized antigen-presenting cells of the 
immune system, such as B-cells and macrophages. These cells ingest foreign proteins from the 

1 o extracellular fluid and express MHC IVantigen complex on the cell surface. This complex activates 

helper T-cells, which then secrete cytokines and other factors that stimulate the immune response. 
MHC molecules also play an important role in organ rejection following transplantation. Rejection 
occurs when the recipient's T-cells respond to foreign MHC molecules on the transplanted organ in 
the same way as to self MHC molecules bound to foreign antigen. (Reviewed in Alberts, B. et al. 
15 (1994) Molecular Biology of the Cell . Garland Publishing, New York NY, pp. 1229-1246.) 

Antibodies, or immunoglobulins, are either expressed on the surface of B-cells or secreted by 
B-cells into the circulation. Antibodies bind and neutralize foreign antigens in the blood and other 
extracellular fluids. The prototypical antibody is a tetramer consisting of two identical heavy 
polypeptide chains (H-chains) and two identical light polypeptide chains (L-chains) interlinked by 

2 o disulfide bonds. This arrangement confers the characteristic Y-shape to antibody molecules. 

Antibodies are classified based on their H-chain composition. The five antibody classes, IgA, IgD, 
IgE, IgG and IgM, are defined by the a, 8, e, y, and \x H-chain types. There are two types of L- 
chains, k and A, either of which may associate as a pair with any H-chain pair. IgG, the most 
common class of antibody found in the circulation, is tetrameric, while the other classes of antibodies 

25 are generally variants or multimers of this basic structure. 

H-chains and L-chains each contain an N-terminal variable region and a C-tenninal constant 
region. The constant region consists of about 1 10 amino acids in L-chains and about 330 or 440 
amino acids in H-chains. Hie amino acid sequence of the constant region is nearly identical among 
H- or L-chains of a particular class. The variable region consists of about 1 1 0 amino acids in both H- 

30 and L-chains. However, the amino acid sequence of the variable region differs among H- or L-chains 
of a particular class. Within each H- or L-chain variable region are three hypervariable regions of 
extensive sequence diversity, each consisting of about 5 to 10 amino acids. In the antibody molecule, 
the H- and L-chain hypervariable regions come together to form the antigen recognition site. 
(Reviewed in Alberts, supra , pp. 1206-1213 and 1216-1217.) 
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Both H-chains and L-chains contain repeated Ig domains. For example, a typical H-chain 
contains four Ig domains, three of which occur within the constant region and one of wbich occurs 
within the variable region and contributes to the formation of the antigen recognition site. Likewise, 
a typical L-chain contains two Ig domains, one of which occurs within the constant region and one of 
which occurs within the variable region. 

The immune system is capable of recognizing and responding to any foreign molecule that 
enters the body. Therefore, the immune system must be armed with a full repertoire of antibodies 
against all potential antigens. Such antibody diversity is generated by somatic rearrangement of gene 
segments encoding variable and constant regions. These gene segments are joined together by site- 
specific recombination which occurs between highly conserved DNA sequences that flank each gene 
segment. Because there are hundreds of different gene segments, millions of unique genes can be 
generated comhi n atori ally ; In addition, imprecise joining of these segments and an unusually high 
rate of somatic mutation within these segments further contribute to the generation of a diverse 
antibody population. 

T-cell receptors are both structurally and functionally related to antibodies. (Reviewed in 
Alberts, supra , pp. 1228-1229.) T-cell receptors are cell surface proteins that bind foreign antigens and 
mediate diverse aspects of the immune response. A typical T-cell receptor is a heterodimer comprised 
of two disulfide-linked polypeptide chains called a and p. Each chain is about 280 amino acids in 
length and contains one variable region and one constant region Each variable or constant region folds 
into an Ig domain The variable regions from the a and p chains come together in the heterodimer to 
form the antigen recognition site. T-cell receptor diversity is generated by somatic rearrangement of 
gene segments encoding the a and p chains. T-cefl receptors recognize small peptide antigens that are 
expressed on the surface of antigen-presenting cells and pathogen-infected cells. These peptide antigens 
are presented on the cell surface in association with major histocon^atibility proteins which provide the 
proper context for antigen recognition 

Secreted and Extracellular Matrix Molecules 

Protein secretion is essential for cellular function Protein secretion is mediated by a signal 
peptide located at the amino terminus of the protein to be secreted. The signal peptide is comprised of 
about ten to twenty hydrophobic amino acids which target the nascent protein from the ribosome to the 
endoplasmic reticulum (ER). Proteins targeted to the ER may either proceed through the secretory 
pathway or remain in any of the secretory organelles such as theER, Golgi apparatus, or lysosomes. 
Proteins that transit through the secretory pathway are either secreted into the extracellular space or 
retained in the plasma membrane. Secreted proteins are often synthesized as inactive precursors that 
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are activated by post-translational processing events during transit through the secretory pathway. 
Such events include glycosylation, proteolysis, and removal of the signal peptide by a signal peptidase. 
Other events that may occur during protein transport include chaperone-dependent unfolding and 
folding of the nascent protein and interaction of the protein with a receptor or pore complex. Examples 
5 of secreted proteins with amino terminal signal peptides include receptors, extracellular matrix 

molecules, cytokines, hormones, growth and differentiation factors, neuropeptides, vasomediators, ion 
channels, transporters/pumps, and proteases. (Reviewed in Alberts, B. et al. (1994) Molecular Biology 
ofTheCefl . Garland Publishing, New York NY, pp. 557-560, 582-592.) 

The extracellular matrix (ECM) is a complex network of glycoproteins, polysaccharides, 

1 o proteoglycans, and other macromolecules that are secreted from the cell into the extracellular space. 

The ECM remains in close association with the cell surface and provides a supportive meshwork that 
profoundly influences cell shape, motility, strength, flexibility, and adhesion. In fact, adhesion of a 
cell to its surrounding matrix is required for cell survival except in the case of metastatic tumor cells, 
which have overcome the need for cell-ECM anchorage. This phenomenon suggests that the ECM 
15 plays a critical role in the molecular mechanisms of growth control and metastasis. (Reviewed in 
Ruoslahti, R (1996) Sci. Am 275:72-77.) Furthermore, the ECM determines the structure and 
physical properties of connective tissue and is particularly important for morphogenesis and other 
processes associated with embryonic development and patten formation. 

The collagens comprise a family of ECM proteins that provide structure to bone, teeth, skin, 

2 o ligaments, tendons, cartilage, blood vessels, and basement membranes. Multiple collagen proteins have 

been identified Three collagen molecules fold together in a triple helix stabilized by interchain disulfide 
bonds. Bundles of these triple helices then associate to form fibrils. Collagen primary structure 
consists of hundreds of (Gly-X-Y) repeats where about a third of the X and Y residues are Pro. 
Glycines are crucial to helix formation as the bulkier amino acid sidechains cannot fold into the triple 

25 helical conformatioa Because of these strict sequence requirements, mutations in collagen genes have 
severe consequences. Osteogenesis imperfecta patients have brittle bones that fracture easily; in severe 
cases patients die inutero or at birth Eblers-Darilos syndrome patients have hyperelastic skin, 
hypermobile joints, and susceptibility to aortic and intestinal rupture. Chondrodysplasia patients have 
short stature and ocular disorders. Alport syndrome patients have hematuria, sensorineural deafness, 

30 and eye lens deformation. (Isselbacher, KJ. et al. (1994) Harrison's Principles of Internal Medicine . 
McGraw-Hill, Inc., New York NY, pp. 2105-2117; and Creighton, T.E. (1984) Proteins. Structures 
and Molecular Principles. W.H. Freeman and Company, New York NY, pp. 191-197.) 

Elastin and related proteins confer elasticity to tissues such as skin, blood vessels, and lungs. 
Elastin is a highly hydrophobic protein of about 750 amino acids that is rich in proline and glycine 
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residues. Elastin molecules are highly cross-linked, forming an extensive extracellular network of fibers 
and sheets. Elastin fibers are surrounded by a sheath of microfibrils which are composed of a number 
of glycoproteins, including fibrillin. Mutations in the gene encoding fibrillin are responsible for 
Marfan 1 s syndrome, a genetic disorder characterized by defects in connective tissue. In severe cases, 
the aortas of afflicted individuals are prone to rupture. (Reviewed in Alberts, supra , pp. 984-986.) 

Fibronectin is a large ECM glycoprotein found in all vertebrates. Fibronectin exists as a dimer 
of two subunits, each containing about 2,500 amino acids. Each subunit folds into a rod-like structure 
containing multiple domains. The domains each contain multiple repeated modules, the most common 
of which is the type HI fibronectin repeat The type III fibronectin repeat is about 90 amino acids in 
length and is also found in other ECM proteins and in some plasma membrane and cytoplasmic 
proteins. Furthermore, some type JB. fibronectin repeats contain a characteristic tripeptide consisting of 
Arginihe^Giydhe-Aq)artic add (RGD). The ROD sequence is recognized by the infegrih family of cell 
surface receptors and is also found in other ECM proteins. Disruption of both copies of the gene 
encoding fibronectin causes early embryonic lethality in mice. The mutant embryos display extensive 
morphological defects, including defects in the formation of the notochard, somites, heart, blood 
vessels, neural tube, and extraembryonic structures. (Reviewed in Alberts, supra, pp. 986-987.) 

Laminin is a major glycoprotein component of the basal lamina which underlies and supports 
epithelial cell sheets. Laminin is one of the first ECM proteins synthesized in the developing embryo. 
Laminin is an 850 kQodalton protein composed of three polypeptide chains joined in the shape of a 
cross by disulfide bonds. Laminin is especially important for angiogenesis and in particular, for 
guiding the formation of capillaries. (Reviewed in Alberts, supra , pp. 990-99 1 .) 

There are many other types of proteinaceous ECM components, most of which can be 
classified as proteoglycans. Proteoglycans are composed of unbranched polysaccharide chains 
(glycosaminoglycans) attached to protein cores. Common proteoglycans include aggrecan, betaglycan, 
decorin, perlecan, serglycin, and syndecan-1. Some of these molecules not only provide mechanical 
support, but also bind to extracellular signaling molecules, such as fibroblast growth factor and 
transforming growth factor p, suggesting a role for proteoglycans in cell-cell communication and cell 
growth. (Reviewed in Alberts, supra, pp. 973-978.) Likewise, the glycoproteins tenascin-C and 
tenascin-R are expressed in developing and lesioned neural tissue and provide stimulatory and anti- 
adhesive (inhibitory) properties, respectively, for axonal growth. (Faissner, A. (1997) Cell Tissue Res. 
290:331-341.) 

Cytoskeletal Molecules 

The cytoskeleton is a cytoplasmic network of protein fibers that mediate cell shape, structure, 
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and movement The cytoskeleton supports the cell membrane and forms tracks along which 
organelles and other elements move in the cytosol. The cytoskeleton is a dynamic structure that 
allows cells to adopt various shapes and to carry out directed movements. Major cytoskeletal fibers 
include the microtubules, the microfilaments, and the intermediate filaments. Motor proteins, 
5 including myosin, dynein, and kinesin, drive movement of or along the fibers. Hie motor protein 
dynamin drives the formation of membrane vesicles. Accessory or associated proteins modify the 
structure or activity of the fibers while cytoskeletal membrane anchors connect the fibers to the cell 
membrane. 
Tubulins 

1 o Microtubules, cytoskeletal fibers with a diameter of about 24 nm, have multiple roles in the 

cell. Bundles of microtubules form cilia and flagella, which are whip-like extensions of the cell 
membrane that are necessary for sweeping materials across an epithelium and for swimming of 
sperm, respectively. Marginal bands of microtubules in red blood cells and platelets are important for 
these cells' pliability. Organelles, membrane vesicles, and proteins are transported in the cell along 
1 5 tracks of microtubules. For example, microtubules run through nerve cell axons, allowing bi- 
directional transport of materials and membrane vesicles between the cell body and the nerve 
terminal. Failure to supply the nerve terminal with these vesicles blocks the transmission of neural 
signals. Microtubules are also critical to chromosomal movement during cell division. Both stable ' 
and short-lived populations of microtubules exist in the cell. 

2 o Microtubules are polymers of GTP-binding tubulin protein subunits. Each subunit is a 

heterodimer of a- and P- tubulin, multiple isoforms of which exist Hie hydrolysis of GTP is linked 
to the addition of tubulin subunits at the end of a microtubule. The subunits interact head to tail to 
form protofilaments ; the protofilaments interact side to side to form a microtubule. A microtubule is 
polarized, one end ringed with a-tubulin and the other with P-tubulin, and the two ends differ in their 
25 rates of assembly. Generally, each microtubule is composed of 13 protofilaments although 1 1 or 15 
protofilament-microtubules are sometimes found. Cilia and flagella contain doublet microtubules. 
Microtubules grow from specialized structures known as centrosomes or microtubule-organizing 
centers (MTOCs). MTOCs may contain one or two centrioles, which are pinwheel arrays of triplet 
microtubules. The basal body, the organizing center located at the base of a cilium or flagellum, 

3 o contains one centriole. Gamma tubulin present in the MTOC is important for nucleating the 

polymerization of a- and P- tubulin heterodimers but does not polymerize into microtubules. 
Microtubule- Associated Proteins 

Microtubule-associated proteins (MAPs) have roles in the assembly and stabilization of 
microtubules. One major family of MAPs, assembly MAPs, can be identified in neurons as well as 

87 



WO 01/62927 



PCT/US01/06059 



non-neuronal cells. Assembly MAPs are responsible for cross-linking microtubules in the cytosol. 
These MAPs are organized into two domains: a basic microtubule-binding domain and an acidic 
projection domain. The projection domain is the binding site for membranes, intermediate filaments, or 
other microtubules. Based on sequence analysis, assembly MAPs can be further grouped into two 
5 types: Type I and Type II. Type I MAPs, which include MAPI A and MAP1B, are large, filamentous 
molecules that co-purify with microtubules and are abundantly expressed in brain and testes. Type I 
MAPs contain several repeats of a positively-charged amino acid sequence motif that binds and 
neutralizes negatively charged tubulin, leading to stabilization of microtubules. MAPI A and MAP1B 
are each derived from a singje precursor polypeptide that is subsequently proteolytically processed to 

1 o generate one heavy chain and one light chain. 

Another light chain, LC3, is a 16.4 kDa molecule that binds MAPI A, MAP1B, and 
miaotubuies. It is suggested that LC3 is synthesized from a source other than the MAPI A or MAP1B 
transcripts, and that the expression of LC3 may be important in regulating the microtubule binding 
activity of MAPI A and MAP1B during cell proliferation (Mann, S.S. et al. (1994) J. Biol. Chem 
15 269:11492-11497). 

Type II MAPs, which include MAP2a, MAP2b, MAP2c, MAP4, and Tau, are characterized 
by three to four copies of an 1 8-residue sequence in the microtubule-binding domain MAP2a, MAP2b, 
and MAP2c are found only in dendrites, MAP4 is found in non-neuronal cells, and Tau is found in 
axons and dendrites of nerve cells. Alternative splicing of the Tau mRNA leads to the existence of 

2 o multiple forms of Tau protein. Tau phosphorylation is altered in neurodegenerative disorders such as 

Alzheimer's disease, Pick's disease, progressive supranuclear palsy, corticobasal degeneration, and 
familial frontotemporal dementia and Parkinsonism linked to chromosome 17. The altered Tau 
phosphorylation leads to a collapse of the microtubule network and the formation of intraneuronal 
Tau aggregates (Spillantini, M.G. and M. Goedert (1998) TYends Neurosci. 21:428-433). 

2 5 The protein pericentral is found in the MTOC and has a role in microtubule assembly. 

Actins 

Microfilaments, cytoskeletal filaments with a diameter of about 7-9 nm, are vital to cell 
locomotion, cell shape, cell adhesion, cell division, and muscle contraction. Assembly and 
disassembly of the microfilaments allow cells to change their morphology. Microfilaments are the 

3 o polymerized form of actin, the most abundant intracellular protein in the eukaryotic cell. Human cells 

contain six isofonns of actin. The three a-actins are found in different kinds of muscle, nonmuscle P- 
actin and nonmuscle y-actin are found in nonmuscle cells, and another y -actin is found in intestinal 
smooth muscle cells. G-actin, the monomeric form of actin, polymerizes into polarized, helical F- 
actin filaments, accompanied by the hydrolysis of ATP to ADP. Actin filaments associate to form 
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bundles and networks, providing a framework to support the plasma membrane and determine cell 
shape. These bundles and networks are connected to the cell membrane. In muscle cells, thin 
filaments containing actin slide past thick filaments containing the motor protein myosin during 
contraction. A family of actin-related proteins exist that are not part of the actin cytoskeleton, but 
5 rather associate with microtubules and dynein. 
Actin-Associated Proteins 

Actin-associated proteins have roles in cross-linking, severing, and stabilization of actin 
filaments and in sequestering actin monomers. Several of the actin-associated proteins have multiple 
functions. Bundles and networks of actin filaments are held together by actin cross-linking proteins. 

1 o These proteins have two actin-binding sites, one for each filament Short cross-linking proteins 

promote bundle formation while longer, more flexible cross-linking proteins promote network 
formation. Calmodulin-like calcium-binding domains in actin cross-linking proteins allow calcium 
regulation of cross-linking. Group I cross-linking proteins have unique actin-binding domains and 
include the 30 kD protein, EF-la, fascin, and scruin. Group II cross-linking proteins have a 7,000- 
15 MW actin-binding domain and include villin and dematin. Group in cross-linking proteins have 
pairs of a 26,000-MW actin-binding domain and include fimbrin, spectrin, dystrophin, ABP 120, and 
filamin. 

Severing proteins regulate the length of actin filaments by breaking them into short pieces or 
by blocking their ends. Severing proteins include gCAP39, severin (fragmin), gelsolin, and villin. 

2 o Capping proteins can cap the ends of actin filaments, but cannot break filaments. Capping proteins 

include CapZ and tropomodulin. The proteins thymosin and profilin sequester actin monomers in the 
cytosol, allowing a pool of unpolymerized actin to exist Hie actin-associated proteins tropomyosin, 
troponin, and caldesmon regulate muscle contraction in response to calcium. 
Intermediate Fflaments and Associated Proteins 
2 5 Intermediate filaments (IFs) are cytoskdetal fibers with a diameter of about 10 nm, 

intermediate between that of microfilaments and microtubules. IFs serve structural roles in the cell, 
reinforcing cells and organizing cells into tissues. IFs are particularly abundant in epidermal cells and 
in neurons. IFs are extremely stable, and, in contrast to microfilaments and microtubules, do not 
function in cell motility. 

30 Five types of IF proteins are known in mammals. Type I and Type H proteins are the acidic 

and basic keratins, respectively. Heterodimers of the acidic and basic keratins are the building blocks 
of keratin IFs. Keratins are abundant in soft epithelia such as skin and cornea, hard epithelia such as 
nails and hair, and in epithelia that line internal body cavities. Mutations in keratin genes lead to 
epithelial diseases including epidermolysis bullosa simplex, bullous congenital ichthyosiform 
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erythroderma (epidermolytic hyperkeratosis), non-epidermolytic and epidennolytic palmoplantar 
keratoderma, ichthyosis bullosa of Siemens, pachyonychia congenita, and white sponge nevus. Some of 
these diseases result in severe skin blistering. (See, e.g., Wawersik, M et al. (1997) J. Biol. Chem. 
272:32557-32565; and Corden L.D. and W.H. McLean (1996) Exp. Dermatol. 5:297-307.) 
5 Type m DF proteins include desmin, glial fibrillary acidic protean, vimentin, and peripherin. 

Desmin filaments in muscle cells link myofibrils into bundles and stabilize sarcomeres in contracting 
muscle. Glial fibrillary acidic protein filaments are found in the glial cells that surround neurons and 
astrocytes. Vimentin filaments are found in blood vessel endothelial ceils, some epithelial ceils, and 
mesenchymal cells such as fibroblasts, and are commonly associated with microtubules. Vimentin 

1 o filaments may have roles in keeping the nucleus and other organelles in place in the cell. Type IV IFs 

include the neurofilaments and nestia Neurofilaments, composed of three polypeptides NF-L, NF-M, 
and NF-H, are frequently associated with microtubules in axons. Neurofilaments arc responsible for 
the radial growth and diameter of an axon, and ultimately for the speed of nerve inqpulse transmission. 
Changes in phosphorylation and metabolism of neurofilaments are observed in neurodegenerative 
1 5 diseases including amyotrophic lateral sclerosis, Parkinson's disease, and Alzheimer's disease (Julien, 
J.P. and W.E. Mushynski (1998) Prog. Nucleic Acid Res. Mol. Biol. 61:1-23). Type V IFs, the lamins, 
are found in the nucleus where they support the nuclear membrane. 

IFs have a central a-hefccal rod region interrupted by short nonhelical linker segments. The rod 
region is bracketed, in most cases, by non-helical head and tail domains. The rod regions of 

2 o intermediate filament proteins associate to form a coiled-coil dimer . A highly ordered assembly process 

leads from the dimers to the IFs. Neither ATP nor GTP is needed for IF assembly, unlike that of 
microfilaments and microtubules. 

EF-associated proteins (IFAPs) mediate the interactions of IFs with one another and with other 
cell structures. IFAPs cross-link IFs into a bundle, into a network, or to the plasma membrane, and 

2 5 may cross-link IFs to the microfilament and microtubule cytosMetoa Microtubules and IFs are in 

particular closely associated IFAPs include BPAG1, plakoglobin, desmoplakin I, desmoplakin n, 
plectin, ankyrin, filaggrin, and lamin B receptor. 
Cvtoskeletal-Membrane Anchors 

Cytoskeletal fibers are attached to the plasma membrane by specific proteins. These 

3 o attachments are important for maintaining cell shape and for muscle contraction. In erythrocytes, the 

spectrin-actin cytoskeleton is attached to cell membrane by three proteins, band 4.1, ankyrin, and 
adducin. Defects in this attachment result in abnormally shaped cells which are more rapidly 
degraded by the spleen, leading to anemia. In platelets, the spectrin-actin cytoskeleton is also linked 
to the membrane by ankyrin; a second actin network is anchored to the membrane by filamin. In 
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muscle cells the protein dystrophin links actin filaments to the plasma membrane; mutations in the 
dystrophin gene lead to Duchenne muscular dystrophy. In adherens junctions and adhesion plaques 
the peripheral membrane proteins a-actinin and vinculin attach actin filaments to the cell membrane. 
IFs are also attached to membranes by cytoskeletal-membrane anchors. The nuclear lamina is 
5 attached to the inner surface of the nuclear membrane by the lamin B receptor. Vimentin IFs are 
attached to the plasma membrane by ankyrin and plectin. Desmosome and hemidesmosome 
membrane junctions hold together epithelial cells of organs and skin. These membrane junctions 
allow shear forces to be distributed across the entire epithelial cell layer, thus providing strength and 
rigidity to the epithelium. IFs in epithelial cells are attached to the desmosome by plakoglobin and 

10 desmoplakins. The proteins that link IFs to hemidesmosomes are not known. Desmin IFs surround 
the sarcomere in muscle and are linked to the plasma membrane by paranemin, synemin, and ankyrin. 
Mvosin-related Motor Proteins 

Myosins are actin-activated ATPases, found in eukaryotic cells, that couple hydrolysis of 
ATP with motion. Myosin provides the motor function for muscle contraction and intracellular 

is movements such as phagocytosis and rearrangement of cell contents during mitotic cell division 
(cytokinesis). The contractile unit of skeletal muscle, termed the sarcomere, consists of highly ordered 
arrays of thin actin-containing filaments and thick myosin-containing filaments. Crossbridges form 
between the thick and thin filaments, and the ATP-dependent movement of myosin heads within the - 
thick filaments pulls the thin filaments, shortening the sarcomere and thus the muscle fiber. 

2 o Myosins are composed of one or two heavy chains and associated light chains. Myosin 

heavy chains contain an ammo-terminal motor or head domain, a neck that is the site of light-chain 
binding, and a carboxy-terminal tail domain. The tail domains may associate to form an a-helical 
coiled coil. Conventional myosins, such as those found in muscle tissue, are composed of two 
myosin heavy-chain subunits, each associated with two light-chain subunits that bind at the neck 
25 region and play a regulatory role. Unconventional myosins, believed to function in intracellular 

motion, may contain either one or two heavy chains and associated light chains. There is evidence for 
about 25 myosin heavy chain genes in vertebrates, more than half of them unconventional. 
Dvnein-related Motor Proteins 

Dyneins are (-) aid-directed motor proteins which act on microtubules. Two classes of 

3 o dyneins, cytosolic and axonemal, have been identified. Cytosolic dyneins are responsible for 

translocation of materials along cytoplasmic microtubules, for example, transport from the nerve 
terminal to the cell body and transport of endocytic vesicles to lysosomes. Cytoplasmic dyneins are 
also reported to play a role in mitosis. Axonemal dyneins are responsible for the beating of flageUa and 
cilia. Dynein on one microtubule doublet walks along the adjacait microtubule doublet This sliding 
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force produces bending forces that cause the flageUum or cilium to beat Dyneans have a native mass 
between 1000 and 2000 kDa and contain either two or three force-producing heads driven by the 
hydrolysis of ATP. The heads are linked via stalks to a basal domain which is composed of a highly 
variable number of accessory intermediate and light chains. 
5 Kinesin-related Motor Proteins 

Kinesins are (+) end-directed motor proteins which act on microtubules. The prototypical 
kinesin molecule is involved in the transport of membrane-bound vesicles and organelles. This function 
is particularly important for axonal transport in neurons. Kinesin is also important in all cell types for 
the transport of vesicles from the Golgi complex to the endoplasmic reticulum This role is critical for 

10 maintaining the identity and functionality of these secretory organelles. 

Kinesins define a ubiquitous, conserved family of ova: 50 protons that can be classified into at 
least 8 subfamilies based on primary amino acid sequence, domain structure, velocity of movement, and 
cellular function. (Reviewed in Moore, J.D. and S A Endow (1996) Bioessays 18:207-219; and Hoyt, 
AM. (1994) Curr. Opin. Cell Biol. 6:63-68.) The prototypical kinesin molecule is a heterotetramer 

15 comprised of two heavy polypeptide chains (KHCs) and two light polypeptide chains (KLCs). The 
KHC subunits are typically referred to as "kinesin. " KHC is about 1000 amino acids in length, and 
KLC is about 550 amino acids in length. Two KHCs dimerize to form a rod-shaped molecule with 
three distinct regions of secondary structure. At one end of the molecule is a globular motor domain 
that functions in ATP hydrolysis and microtubule binding. Kinesin motor domains are highly conserved 

20 and share over 70% identity. Beyond the motor domain is an a-hdical coiled-coil region which 
mediates dimerization. At the other end of the molecule is a fan-shaped tail that associates with 
molecular cargo. The tail is formed by the interaction of the KHC C -termini with the two KLCs. 

Members of the more divergent subfamilies of kinesins are called kinesin-rdated proteins 
(KRPs), many of which function during mitosis in eukaryotes (Hoyt, supra). Some KRPs are required 

2 s for assembly of the mitotic spindle. In vivo and in vitro analyses suggest that these KRPs exert force 
on microtubules that comprise the mitotic spindle, resulting in the separation of spindle poles. 

r" 

Phosphorylation of KRP is required for this activity. Failure to assemble the mitotic spindle results in 
abortive mitosis and chromosomal aneuploidy, the latter condition being characteristic of cancer cells. 
In addition, a unique KRP, centromere protein E, localizes to the kinetochore of human mitotic 
so chromosomes and may play a role in their segregation to opposite spindle poles. 
Dvnamin-related Motor Proteins 

Dynamin is a large GTPase motor protein that functions as a "molecular pinchase," 
generating a mechanochemical force used to sever membranes. This activity is important in forming 
clathrin-coated vesicles from coated pits in endocytosis and in the biogenesis of synaptic vesicles in 
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neurons. Binding of dynamin to a membrane leads to dynamin' s self-assembly into spirals that may 
act to constrict a flat membrane surface into a tubule. GTP hydrolysis induces a change in 
conformation of the dynamin polymer that pinches the membrane tubule, leading to severing of the 

membrane tubule and formation of a membrane vesicle. Release of GDP and inorganic phosphate 

j 

5 leads to dynamin disassembly. Following disassembly the dynamin may either dissociate from the 
membrane or remain associated to the vesicle and be transported to another region of the cell. Three 
homologous dynamin genes have been discovered, in addition to several dynamin-related proteins. 
Conserved dynamin regions are the N-terminal GTP-binding domain, a central pleckstrin homology 
domain that binds membranes, a central coiled-coil region that may activate dynamin's GTPase 

1 o activity, and a C-terminal proline-rich domain that contains several motifs that bind SH3 domains on 

other proteins. Some dynamin-related proteins do not contain the pleckstrin homology domain or the 
proline-rich domain. (See McNiven, MA, (1998) Cell 94:151-154; Scaife, R.M. and R.L. Margolis 
(1997) Cell. Signal. 9:395-401.) 

The cytoskeleton is reviewed in Lodish, H. et al. (1995) Molecular Cell Biology. Scientific 
1 5 American Books, New York NY. 

Ribosomal Molecules 

Ribosomal RNAs (rRNAs) are assembled, along with ribosomal proteins, into ribosomes, 
which are cytoplasmic particles that translate messenger RNA into polypeptides. The eukaryotic - 

2 o ribosome is composed of a 60S (large) subunit and a 40S (small) subunit, which together form the 

80S ribosome. In addition to the 18S, 28S, 5S, and 5 .8S rRNAs, the ribosome also contains more 
than fifty proteins. Hie ribosomal proteins have a prefix which denotes the subunit to which they 
belong, either L (large) or S (small). Ribosomal protein activities include binding rRNA and 
organizing the conformation of the junctions between rRNA helices (Woodson, S.A. and N.B. 
25 Leontis (1998) Cmr. Opin. Struct Biol. 8:294-300; Ramakrishnan, V. and S.W. White (1998) Trends 
Biochem. Sci. 23:208-212.) Three important sites are identified on the ribosome. The aminoacyl- 
tRNA site (A site) is whore charged tRNAs (with the exception of the inrtiator-tRNA) bind on arrival 
at the ribosome. The peptidyl-tRNA site (P site) is where new peptide bonds are formed, as well as 
where the initiator tRNA binds. The exit site (E site) is where deacylated tRNAs bind prior to their 

3 o release from the ribosome. (The ribosome is reviewed in Stryer, L. (1 995) Biochemistry W.H. 

Freeman and Company, New York NY, pp. 888-908; and Lodish, H. et al. (1995) Molecular Cell 
Biology Scientific American Books, New York NY. pp. 1 19-138.) 

Chromatin Molecules 
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The nuclear DNA of eukaryotes is organized into chromatin. Two types of chromatin are 
observed: euchromatin, some of which may be transcribed, and heterochromatin so densely packed that 
much of it is inaccessible to transcriptioa Chromatin packing thus serves to regulate protein 
expression in eukaryotes. Bacteria lack chromatin and the chromatin-packing level of gene regulation. 
5 The fundamental unit of chromatin is the nucleosome of 200 DNA base pairs associated with 

two copies each of histones H2A, H2B, H3, and H4. Adjascent nucleosomes are linked by another 
class of histones, HI . Low molecular weight non-histone proteins called the high mobility group 
(HMG) t associated with chromatin, may function in the unwinding of DNA and stabilization of single- 
stranded DNA Cbromodomain proteins function in compaction of chromatin into its transcriptionally 

10 silent heterochromatin form 

During mitosis, all DNA is compacted into heterochromatin and transcription ceases. 
Transcription in interphase begins with the activation of a region of chromatin. Active chromatin is 
decondensed Decondensation appears to be accompanied by changes in binding coefficient, 
phosphorylation and acetylation states of chromatin histones. HMG proteins HMG13 and HMG17 

15 selectively bind activated chromatin. Topoisomerases remove superhelical tension on DNA The 
activated region decondenses, allowing gene regulatory proteins and transcription factors to assemble 
on the DNA 

Patterns of chromatin structure can be stably inherited, producing heritable patterns of gene 
expressioa In mammals, one of the two X chromosomes in each female cell is inactivated by 

2 o condensation to heterochromatin during zygote development The inactive state of this chromosome is 

inherited, so that adult females are mosaics of clusters of paternal-X and maternal-X clonal ceil groups. 
The condensed X chromosome is reactivated in meiosis. 

Chromatin is associated with disorders of protein expression such as thalassemia, a genetic 
anemia resulting from the removal of the locus control region (LCR) required for decondensation of the 
25 gjobin gene locus. 

For a review of chromatin structure and function see Alberts, B. et al. (1994) Molecular Cell 
Biology, third edition, Garland Publishing, Inc., New York NY, pp. 351-354, 433-439. 

Electron Transfer Associated Molecules 

3 o Electron carriers such as cytochromes accept electrons from N ADH or FADHj and donate 

them to other electron carriers. Most electron-transferring proteins, except ubiquinone, are prosthetic 
groups such as flavins, heme, FeS clusters, and copper, bound to inner membrane proteins. 
Adrenodoxin, for example, is an FeS protein that forms a complex with NADPH:adrenodoxin 
reductase and cytochrome p450. Cytochromes contain a heme prosthetic group, a porphyrin ring 
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containing a tightly bound iron atom. Electron transfer reactions play a crucial role in cellular energy 
production. 

Energy is produced by the oxidation of glucose and fatty acids. Glucose is initially converted 
to pyruvate in the cytoplasm. Fatty acids and pyruvate are transported to the mitochondria for 
5 complete oxidation to C0 2 coupled by enzymes to the transport of electrons from NADH and FADH^ 
to oxygen and to the synthesis of ATP (oxidative phosphorylation) from ADP and 

Pyruvate is transported into the mitochondria and converted to acetyl-CoA for oxidation via 
the citric acid cycle, involving pyruvate dehydrogenase components, dihydrolipoyl transacetylase, and 
dihydrolipoyl dehydrogenase. Enzymes involved in the citric acid cycle include: citrate synthetase, 
10 aconitases, isocitrate dehydrogenase, alpha-ketoglutarate dehydrogenase complex including 
transsuccinylases, succinyl CoA synthetase, succinate dehydrogenase, fumarases, and malate 
dehydrogenase. Acetyl CoA. is oxidized to C0 2 with concomitant formation of -NADH, FADH 2 , and 
GTP. In oxidative phosphorylation, the transfer of electrons from NADH and FADH 2 to oxygen by 
dehydrogenases is coupled to the synthesis of ATP from ADP and P t by the F^ ATPase complex in 
is the mitochondrial inner membrane. Enzyme complexes responsible for electron transport and ATP 
synthesis include the FJF X ATPase complex, ubiquinone(CoQ)-cytochrome c reductase, ubiquinone 
reductase, cytochrome b, cytochrome FeS protein, and cytochrome c oxidase. 

ATP synthesis requires membrane transport enzymes including the phosphate transporter and 
the ATP- ADP antiport protein. The ATP-binding casette (ABC) supeifamily has also been suggested 
20 as belonging to the mitochondrial transport group (Hogue, DJL. et al. (1999) J. Mol. Biol. 285:379- 
389). Brown fat uncoupling protein dissipates oxidative energy as heat, and may be involved the fever 
response to infection and trauma (Cannon, B. et al. (1998) Ann. NY Acad. Sci. 856:171-187). 

Mitochondria are oval-shaped organelles comprising an outer membrane, a tightly folded 
inner membrane, an intermembrane space between the outer and inner membranes, and a matrix 

2 5 inside the inner membrane. The outer membrane contains many porin molecules that allow ions and 

charged molecules to enter the intermembrane space, while the inner membrane contains a variety of 
transport proteins that transfer only selected molecules. Mitochondria are the primary sites of energy 
production in cells. 

Mitochondria contain a small amount of DN A. Human mitochondrial DNA encodes 1 3 

3 o proteins, 22 tRNAs, and 2 rRNAs. Mitochondrial-DN A encoded proteins include NADH-Q 

reductase, a cytochrome reductase subunit, cytochrome oxidase subunits, and ATP synthase subunits. 

Electron-transfer reactions also occur outside the mitochondria in locations such as the 
endoplasmic reticulum, which plays a crucial role in lipid and protein biosynthesis. Cytochrome b5 
is a central electron donor for various reductive reactions occurring on the cytoplasmic surface of liver 
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endoplasmic reticulum. Cytochrome b5 has been found in Golgi, plasma, endoplasmic reticulum 
(ER), and microbody membranes. 

For a review of mitochondrial metabolism and regulation, see Lodish, H. et al. (1995) 
Molecular Cell Biology, Scientific American Books, New York NY, pp. 745-797 and Stryer (1995) 
Biochemistry, W.H. Freeman and Co., San Francisco CA, pp 529-558, 988-989. 

Hie majority of mitochondrial proteins are encoded by nuclear genes, are synthesized on 
cytosolic ribosomes, and are imported into the mitochondria. Nuclear-encoded proteins which are 
destined for the mitochondrial matrix typically contain positively-charged amino terminal signal 
sequences. Import of these preproteins from the cytoplasm requires a multisubunit protein complex 
in the outer membrane known as the translocase of outer mitochondrial membrane (TOM; previously 
designated MOM; Pfanner, N. et al. (1996) Trends Biochem. Sci. 21:51-52) and at least three inner 
membrane proteins which comprise the translocase of inner mitochondrial membrane (TIM; 
previously designated MIM; Pfanner, supra) . An inside-negative membrane potential across the inner 
mitochondrial membrane is also required for preprotein import Preproteins are recognized by surface 
receptor components of the TOM complex and are translocated through a proteinaceous pore formed 
by other TOM components. Proteins targeted to the matrix are then recognized by the import 
machinery of the TIM complex. The import systems of the outer and inner membranes can function 
independently (Segui-Real, B. et al. (1993) EMBO J. 12:2211-2218). 

Once precursor proteins are in the mitochondria, the leader peptide is cleaved by a signal 
peptidase to generate the mature protein. Most leader peptides are removed in a one step process by a 
protease termed mitochondrial processing peptidase (MPP) (Paces, V. et al. (1993) Proc. Natl. Acad. 
Sci. USA 90:5355-5358). In some cases a two-step process occurs in which MPP generates an 
intermediate precursor form which is cleaved by a second enzyme, mitochondrial intermediate 
peptidase, to generate the mature protein. 

Mitochondrial dysfunction leads to impaired calcium buffering, generation of free radicals 
that may participate in deleterious intracellular and extracellular processes, changes in mitochondrial 
permeability and oxidative damage which is observed in several neurodegenerative diseases. 
Neurodegenerative diseases linked to mitochondrial dysfunction include some forms of Alzheimer's 
disease, Friedreich's ataxia, familial amyotrophic lateral sclerosis, and Huntington's disease (Beal, 
MP. (1998) Biochim. Biophys. Acta 1366:211-213). The myocardium is heavily dependent on 
oxidative metabolism, so mitochondrial dysfunction often leads to heart disease (DiMauro, S. and M. 
Hirano (1998) Curr. Opia Cardiol 13: 190-197). Mitochondria are implicated in disorders of cell 
proliferation, since they play an important role in a cell' s decision to proliferate or self-destruct 
through apoptosis. The oncoprotein Bcl-2, for example, promotes cell proliferation by stabilizing 
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mitochondrial membranes so that apoptosis signals are not released (Susin, S.A. (1998) Biochim. 
Biophys. Acta 1366:151-165). 

Transcription Factor Molecules 
5 Multicellular organisms are comprised of diverse cell types that differ dramatically both in 

structure and function. The identity of a cell is determined by its characteristic pattern of gene 
expression, and different cell types express overlapping but distinctive sets of genes throughout 
development Spatial and temporal regulation of gene expression is critical for the control of cell 
proliferation, cell differentiation, apoptosis, and other processes that contribute to organismal 

1 o development Furthermore, gene expression is regulated in response to extracellular signals that 

mediate cell-cell communication and coordinate the activities of different cell types. Appropriate 
gene regulation also ensures that cells function efficiently by expressing only those genes whose 
functions are required at a given time. 

Transcriptional regulatory proteins are essential for the control of gene expression. Some of 
1 5 these proteins function as transcription factors that initiate, activate, repress, or terminate gene 

transcription. Transcription factors generally bind to the promoter, enhancer, and upstream regulatory 
regions of a gene in a sequence-specific manner, although some factors bind regulatory elements 
within or downstream of a gene's coding region. Transcription factors may bind to a specific region 
of DNA singly or as a complex with other accessory factors. (Reviewed in Lewin, B. (1990) Genes 

2 o IV, Oxford University Press, New York NY, and Cell Press, Cambridge MA, pp. 554-570.) 

The double helix structure and repeated sequences of DNA create topological and chemical 
features which can be recognized by transcription factors. These features are hydrogen bond donor 
and acceptor groups, hydrophobic patches, major and minor grooves, and regular, repeated stretches 
of sequence which induce distinct bends in the helix. Typically, transcription factors recognize ' 

2 5 specific DNA sequence motifs of about 20 nucleotides in length. Multiple, adjacent transcription 

factor-binding motifs may be required for gene regulation. 

Many transcription factors incorporate DNA-binding structural motifs which comprise either 
a helices or 6 sheets that bind to the major groove of DNA. Four well-characterized structural motifs 
are helix-turn-helix, zinc finger, leucine zipper, and helix-loop-helix. Proteins containing these 

3 o motifs may act alone as monomers, or they may form homo- or heterodimers that interact with DNA. 

The helix-turn-helix motif consists of two a helices connected at a fixed angle by a short 
chain of amino acids. One of the helices binds to the major groove. Helix-turn-helix motifs are 
exemplified by the homeobox motif which is present in homeodomain proteins. These proteins are 
critical for specifying the anterior-posterior body axis during development and are conserved 
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throughout the animal kingdom. The Antennapedia and Ultrabithorax proteins of Drosophila 
melanogaster are prototypical homeodomain proteins (Pabo, CO. and R.T. Sauer (1992) Annu. Rev. 
Biochem. 61:1053-1095). 

. The zinc finger motif, which binds zinc ions, generally contains tandem repeats of about 30 
5 amino acids consisting of periodically spaced cysteine and histidine residues. Examples of this 

sequence pattern, designated C2H2 and C3HC4 ("RING" finger), have been described (Lewin, supra) . 
Zinc finger proteins each contain an a helix and an antiparaUel B sheet whose proximity and 
conformation are maintained by the zinc ion. Contact with DNA is made by the arginine prece ding 
the a helix and by the second, third, and sixth residues of the a helix. Variants of the zinc finger 

1 o motif include poorly defined cy steine-rich motifs which bind zinc or other metal ions. These motifs 

may not contain histidine residues and are generally nonrepetitive. 

Hie leucine zipper motif comprises a stretch of amino acids rich in leucine which can form an 
amphipathic a helix. This structure provides the basis for dimerization of two leucine zipper 
proteins. The region adjacent to the leucine zipper is usually basic, and upon protein dimerization, is 
15 optimally positioned for binding to the major groove. Proteins containing such motifs are generally 
referred to as bZIP transcription factors. 

The helix-loop-helix motif (HLH) consists of a short a helix connected by a loop to a longer 
a helix. The loop is flexible and allows the two helices to fold back against each other and to bind to 
DNA. The transcription factor Myc contains a prototypical HLH motif. 

2 o Most transcription factors contain characteristic DNA binding motifs, and variations on the 

above motifs and new motifs have been and are currently being characterized (Faisst, S. and S. Meyer 
(1992) Nucleic Acids Res. 20:3-26). 

Many neoplastic disorders in humans can be attributed to inappropriate gene expression. 
Malignant cell growth may result from either excessive expression of tumor promoting genes or 
25 insufficient expression of tumor suppressor genes (Cleary, MJL. (1992) Cancer Surv. 15:89-104). 
Chromosomal translocations may also produce chimeric loci which fuse the coding sequence of one 
gene with the regulatory regions of a second unrelated gene. Such an arrangement likely results in 
inappropriate gene transcription, potentially contributing to malignancy. 

In addition, the immune system responds to infection or trauma by activating a cascade of 

3 o events that coordinate the progressive selection, amplification, and mobilization of cellular defense 

mechanisms. A complex and balanced program of gene activation and repression is involved in this 
process. However, hyperactivity of the immune system as a result of improper or insufficient 
regulation of gene expression may result in considerable tissue or organ damage. This damage is well 
documented in immunological responses associated with arthritis, allergens, heart attack, stroke, and 
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infections (Isselbacher, KJ. et al. (1996) Harrison's Principles of Internal Medicine. 13/e, McGraw 
Hill, Inc. and Teton Data Systems Software). 

Furthermore, the generation of multicellular organisms is based upon the induction and 
coordination of cell differentiation at the appropriate stages of development Central to this process is 
5 differential gene expression, which confers the distinct identities of cells and tissues throughout the 
body. Failure to regulate gene expression during development can result in developmental disorders. 
Human developmental disorders caused by mutations in zinc finger-type transcriptional regulators 
include: urogenenital developmental abnormalities associated with WT1; Greig 
cephalopolysyndactyly, Pallister-Hall syndrome, and postaxial Polydactyly type A (GLI3); and 
lo Townes-Brocks syndrome, characterized by anal, renal, limb, and ear abnormalities (SALL1) 

(Engelkamp, D. and V. van Heyningen (1996) Cuir, Opin. Genet Dev. 6:334-342; Kohlhase, J. et al. 
(1999) Am. J. Hum. Genet. 64:435-445). 

Cell Membrane Molecules 

15 Eukaryotic cells are surrounded by plasma membranes which enclose the cell and maintain an 

environment inside the cell that is distinct from its surroundings. In addition, eukaryotic organisms 
are distinct from prokaryotes in possessing many intracellular organelle and vesicle structures. Many 
of the metabolic reactions which distinguish eukaryotic biochemistry from prokaryotic biochemistry 
take place within these structures. Hie plasma membrane and the membranes surrounding organelles 

20 and vesicles are composed of phosphoglycerides, fatty acids, cholesterol, phospholipids, glycolipids, 
proteoglycans, and proteins. These components confer identity and functionality to the membranes 
with which they associate. 
Integral Membrane Proteins 

The majority of known integral membrane proteins are transmembrane proteins (TM) which 

25 are characterized by an extracellular, a transmembrane, and an intracellular domain. TM domains are 
typically comprised of 15 to 25 hydrophobic amino adds which are predicted to adopt an a-helical 
conformation. TM proteins are classified as bitopic (Types I and II) and polytopic (Types III and IV) 
(Singer, SJ. (1990) Annu. Rev. Cell Biol. 6:247-296). Bitopic proteins span the membrane once 
while polytopic proteins contain multiple membrane-spanning segments. TM proteins function as 

3 o cell-surface receptors, receptor-interacting proteins, transporters of ions or metabolites, ion channels, 
cell anchoring proteins, and cell type-specific surface antigens. 

Many membrane proteins (MPs) contain amino acid sequence motifs that target these proteins 
to specific subcellular sites. Examples of these motifs include PDZ domains, KDEL, RGD, NGR, 
and GSL sequence motifs, von Willebrand factor A (vWFA) domains, and EGF-like domains. RGD, 
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NGR, and GSL motif-containing peptides have been used as drag delivery agents in targeted cancer 
treatment of tumor vasculature (Arap, W. et al. (1998) Science 279:377-380). Furthermore, MPs may 
also contain amino acid sequence motifs, such as the carbohydrate recognition domain (CRD), that 
mediate interactions with extracellular or intracellular molecules. 
G-Protein Coupled Receptors 

G-protein coupled receptors (GPCR) are a superfamily of integral membrane proteins which 
transduce extracellular signals. GPCRs include receptors for biogenic amines, lipid mediators of 
inflammation, peptide hormones, and sensory signal mediators. The structure of these 
highly-conserved receptors consists of seven hydrophobic transmembrane regions, an extracellular 
N-terminus, and a cytoplasmic C -terminus. Three extracellular loops alternate with three intracellular 
loops to link the seven transmembrane regions. Cysteine disulfide bridges connect the second and 
third extracellular loops. The most conserved regions of GPCRs are the transmembrane regions and 
the first two cytoplasmic loops. A conserved, acidic-Arg-aromatic residue triplet present in the 
second cytoplasmic loop may interact with G proteins. A GPCR consensus pattern is characteristic of 
most proteins belonging to this superfamily (ExPASy PROSITE document PS00237; and Watson, S. 
and S. Arkinstall (1994) The G-protein Linked Receptor Facts Book . Academic Press, San Diego CA, 
pp. 2-6). Mutations and changes in transcriptional activation of GPCR-encoding genes have been 
associated with neurological disorders such as schizophrenia, Parkinson's disease, Alzheimer's 
disease, drug addiction, and feeding disorders. 
Scavenger Receptors 

Macrophage scavenger receptors with broad ligand specificity may participate in the binding 
of low density lipoproteins (LDL) and foreign antigens. Scavenger receptors types I and II are 
trimeric membrane proteins with each subunit containing a small N-terminal intracellular domain, a 
transmembrane domain, a large extracellular domain, and a C-terminal cysteine-rich domain. Hie 
extracellular domain contains a short spacer region, an a-helical coiled-coil region, and a triple helical 
collagen-like region. These receptors have been shown to bind a spectrum of ligands, including 
chemically modified lipoproteins and albumin, polyribonucleotides, polysaccharides, phospholipids, 
and asbestos (Matsumoto, A. et al. (1990) Proc. Natl. Acad. Sci. USA 87:9133-9137; andElomaa, 0. 
et al. (1995) Cell 80:603-609). The scavenger receptors are thought to play a key role in 
atherogenesis by mediating uptake of modified LDL in arterial walls, and in host defense by binding 
bacterial endotoxins, bacteria, and protozoa. 
Tetraspan Family Proteins 

The transmembrane 4 superfamily (TM4SF) or tetraspan family is a multigene family 
encoding type in integral membrane proteins (Wright, M.D. and M.G. Tomlinson (1994) Immunol. 
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Today 15:588-594). The TM4SF is comprised of membrane proteins which traverse the cell 
membrane four times. Members of the TM4SF include platelet and endothelial cell membrane 
proteins, melanoma-associated antigens, leukocyte surface glycoproteins, colonal carcinoma antigens, 
tumor-associated antigens, and surface proteins of the schistosome parasites (Jankowski, S.A. (1994) 
5 Oncogene 9:1205-121 1). Members of the TM4SF share about 25-30% amino acid sequence identity 
with one another. 

A number of TM4SF members have been implicated in signal transduction, control of cell 
adhesion, regulation of cell growth and proliferation, including development and oncogenesis, and 
cell motility, including tumor cell metastasis. Expression of TM4SF proteins is associated with a 

1 o variety of tumors and the level of expression may be altered when cells are growing or activated. 

Tumor Antigens 

Tumor antigens are eeH surface molecules that are differentially expressed in tumor ceils 
relative to normal cells. Tumor antigens distinguish tumor cells immunologically from normal cells 
and provide diagnostic and therapeutic targets for human cancers (Takagi, S. et al. (1995) Int J. 
15 Cancer 61:706-715; Liu, E. et al. (1992) Oncogene 7:1027-1032). 
Leukocyte Antigens 

Other types ? of cell surface antigens include those identified on leukocytic cells of the immune 
system. These antigens have been identified using systematic, monoclonal antibody (mAb)-based 
"shot gun" techniques. These techniques have resulted in the production of hundreds of mAbs 

2 o directed against unknown cell surface leukocytic antigens. These antigens have been grouped into 

"clusters of differentiation" based on common immunocytochemical localization patterns in various 
differentiated and undifferentiated leukocytic cell types. Antigens in a given cluster are presumed to 
identify a single cell surface protein and are assigned a "cluster of differentiation" or "CD" 
designation. Some of the genes encoding proteins identified by CD antigens have been cloned and 

2 5 verified by standard molecular biology techniques. CD antigens have been characterized as both 
transmembrane proteins and cell surface proteins anchored to the plasma membrane via covalent 
attachment to fatty acid-containing glycolipids such as glycosylphosphatidylinositol (GPI). 
(Reviewed in Barclay, A.N. et al. (1995) The Leucocyte Antigen Facts Book . Academic Press, San 
Diego CA,pp. 17-20.) 

30 Tnn Channels 

Ion channels are found in the plasma membranes of virtually every cell in the body. For 
example, chloride channels mediate a variety of cellular functions including regulation of membrane 
potentials and absorption and secretion of ions across epithelial membranes. Chloride channels also 
regulate the pH of organelles such as the Golgi apparatus and endosomes (see, e.g., Greger, R. (1988) 
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Annu. Rev. Physiol. 50:111-122). Electrophysiological and pharmacological properties of chloride 
channels, including ion conductance, current-voltage relationships, and sensitivity to modulators, 
suggest that different chloride channels exist in muscles, neurons, fibroblasts, epithelial cells, and 
lymphocytes. 

5 Many ion channels have sites for phosphorylation by one or more protein kinases including 

protein kinase A, protein kinase C, tyrosine kinase, and casein kinase II, all of which regulate ion 
channel activity in cells. Inappropriate phosphorylation of proteins in cells has been linked to 
changes in cell cycle progression and cell differentiation. Changes in the cell cycle have been linked 
to induction of apoptosis or cancer. Changes in cell differentiation have been linked to diseases and 
10 disorders of the reproductive system, immune system, skeletal muscle, and other organ systems. 
Proton Pumps 

Proton ATPases comprise a large class of membrane proteins that use the energy of ATP 
hydrolysis to generate an electrochemical proton gradient across a membrane. The resultant gradient 
may be used to transport other ions across the membrane (Na\ K + , or CI") or to maintain organelle 

15 pH. Proton ATPases are further subdivided into the mitochondrial F-ATPases, the plasma membrane 
ATPases, and the vacuolar ATPases. The vacuolar ATPases establish and maintain an acidic pH 
within various organelles involved in the processes of endocytosis and exocytosis (Mellman, I. et al. 
(1986) Annu. Rev. Biochem. 55:663-700). 

Proton-coupled, 12 membrane-spanning domain transporters such as PEPT 1 and PEPT 2 are 

2 o , responsible for gastrointestinal absorption and for renal reabsorption of peptides using an 

electrochemical H* gradient as the driving force. Another type of peptide transporter, the TAP 
transporter, is a heterodimer consisting of TAP 1 and TAP 2 and is associated with antigen 
processing. Peptide antigens are transported across the membrane of the endoplasmic reticulum by 
TAP so they can be expressed on the cell surface in association with MHC molecules. Each TAP 

2 5 protein consists of multiple hydrophobic membrane spanning segments and a highly conserved. 
ATP-binding cassette (Boll, M. et al. (1996) Proc. Natl. Acad. Sci. USA 93:284-289). Pathogenic 
microorganisms, such as herpes simplex virus, may encode inhibitors of TAP-mediated peptide 
transport in order to evade immune surveillance (Marusina, K. and J. J Manaco (1996) Curr. Opin. 
Hematol. 3:19-26). 

30 ABC Transporters 

The ATP-binding cassette (ABC) transporters, also called the "traffic ATPases", comprise a 
superf amily of membrane proteins that mediate transport and channel functions in prokaryotes and 
eukaryotes (Higgins, C.F. (1992) Annu. Rev. Cell Biol. 8:67-113). ABC proteins share a similar 
overall structure and significant sequence homology. All ABC proteins contain a conserved domain 
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of approximately two hundred amino acid residues which includes one or more nucleotide binding 
domains. Mutations in ABC transporter genes are associated with various disorders, such as 
hyperbilirubinemia n/Dubin- Johnson syndrome, recessive Stargardt's disease, X-linked 
adrenoleukodystrophy, multidrug resistance, celiac disease, and cystic fibrosis. 
5 Peripheral and Anchored Membrane Proteins 

Some membrane proteins are not membrane-spanning but are attached to the plasma 
membrane via membrane anchors or interactions with integral membrane proteins. Membrane 
anchors are covalently joined to a protein post-translationally and include such moieties as prenyl, 
myristyl, and glycosylphosphatidyl inositol groups. Membrane localization of peripheral and 

1 o anchored proteins is important for their function in processes such as receptor-mediated signal 

transduction. For example, prenylation of Ras is required for its localization to the plasma membrane 
and for its normal and oncogenic functions in signal transduction. 
Vesicle Coat Proteins 

Intercellular communication is essential for the development and survival of multicellular 
15 organisms. Cells communicate with one another through the secretion and uptake of protein 

signaling molecules. The uptake of proteins into the cell is achieved by the endocytic pathway, in 
which the interaction of extracellular signaling molecules with plasma membrane receptors results in 
the formation of plasma membrane-derived vesicles that enclose and transport the molecules into the • 
cytosol. These transport vesicles fuse with and mature into endosomal and lysosomal (digestive) 

2 o compartments. The secretion of proteins from the cell is achieved by exocytosis, in which molecules 

inside of the cell proceed through the secretory pathway. In this pathway, molecules transit from the 
ER to the Golgi apparatus and finally to the plasma membrane, where they are secreted from the cell. 

Several steps in the transit of material along the secretory and endocytic pathways require the 
formation of transport vesicles. Specifically, vesicles form at the transitional endoplasmic reticulum 

2 5 (tER), the rim of Golgi cisternae, the face of the Trans-Golgi Network (TGN)> the plasma membrane 

(PM), and tubular extensions of the endosomes. Vesicle formation occurs when a region of 
membrane buds off from the donor organelle. The membrane-bound vesicle contains proteins to be 
transported and is surrounded by a proteinaceous coat, the components' of which are recruited from 
the cytosol. Two different classes of coat protein have been identified. Clathrin coats form on 

3 o vesicles derived from the TGN and PM, whereas coatomer (COP) coats form on vesicles derived 

from the ER and Golgi. COP coats can be further classified as COPI, involved in retrograde traffic 
through the Golgi and from the Golgi to the ER, and COPII, involved in anterograde traffic from the 
ER to the Golgi (Mellman, supra) . 

In clathiin-based vesicle formation, adapter proteins bring vesicle cargo and coat proteins 
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together at the surface of the budding membrane. Adapter protein- 1 and -2 select cargo from the 
TGN and plasma membrane, respectively, based on molecular information encoded on the 
cytoplasmic tail of integral membrane cargo proteins. Adapter proteins also recruit clathrin to the bud 
site. Clathrin is a protein complex consisting of three large and three small polypeptide chains 
5 arranged in a three-legged structure called a triskelion. Multiple triskelions and other coat proteins 
appear to self-assemble on the membrane to form a coated pit This assembly process may serve to 
deform the membrane into a budding vesicle. GTP-bound ADP-ribosylation factor (Art) is also 
incorporated into the coated assembly. Another small G-protein, dynamin, forms a ring complex 
around the neck of the forming vesicle and may provide the mechanochemical force to seal the bud, 

10 thereby releasing the vesicle. Hie coated vesicle complex is then transported through the cytosol. 
During the transport process, Arf-bound GTP is hydrolyzed to GDP, and the coat dissociates from the 
transport vesicle (West, M.A. et al. (1997) J. Cell Biol. 138:1239-1254). 

Vesicles which bud from the ER and the Golgi are covered with a protein coat similar to the 
clathrin coat of endocytic and TGN vesicles. The coat protein (COP) is assembled from cytosolic 

15 precursor molecules at specific budding regions on the organelle. The COP coat consists of two 

major components, a G-protein (Arf or Sar) and coat protomer (coatomer). Coatomer is an equimolar 
complex of seven proteins, tamed alpha-, beta-, beta'-, gamma-, delta-, epsilon- and zeta-COP. The 
coatomer complex binds to dilysine motifs contained on the cytoplasmic tails of integral membrane 
proteins. These include the KKXX retrieval motif of membrane proteins of the ER and 

2 o dibasic/diphenylamine motifs of members of the p24 family. The p24 family of type I membrane 
proteins represent the major membrane proteins of COPI vesicles (Harter, C. and F.T. Wieland (1998) 
Proc. Natl. Acad. Sci. USA 95:11649-11654). 

Organelle Associated Molecules 

2 5 Eukaryotic cells are organized into various cellular organdies which has the effect of 

separating specific molecules and their functions from one another and from the cytosol. Within the 
cell, various membrane structures surround and define these organelles while allowing them to 
interact with one another and the cell environment through both active and passive transport 
processes. Important cell organelles include the nucleus, the Golgi apparatus, the endoplasmic 

3 o reticulum, mitochondria, peroxisomes, lysosomes, endosomes, and secretory vesicles. 
Nucleus 

The cell nucleus contains all of the genetic information of the cell in the form of DNA, and 
the components and machinery necessary for replication of DNA and for transcription of DNA into 
RNA. (See Alberts, B, et al. (1994) Molecular Biology of the Cell . Garland Publishing Inc., New 
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York NY, pp. 335-399.) DNA is organized into compact structures in the nucleus by interactions 
with various DNA-binding proteins such as histones and non-histone chromosomal proteins. 
DNA-specific nucleases, DNAses, partially degrade these compacted structures prior to DNA 
replication or transcription. DNA replication takes place with the aid of DNA helicases which 
5 unwind the double-stranded DNA helix, and DNA polymerases that duplicate the separated DNA 
strands. 

Transcriptional regulatory proteins are essential for the control of gene expression. Some of 
these proteins function as transcription factors that initiate, activate, repress, or terminate gene 
transcription. Transcription factors generally bind to the promoter, enhancer, and upstream 

1 o regulatory regions of a gene in a sequence-specific manner, although some factors bind regulatory 

elements within or downstream of a gene's coding region. Transcription factors may bind to a specific 
region of DNA singly or as a complex with other accessory factors. (Reviewed in Le win, B. (1990) 
Genes IV , Oxford University Press, New York NY, and Cell Press, Cambridge MA, pp. 554-570.) 
Many transcription factors incorporate DNA-binding structural motifs which comprise either a 

1 5 helices or B sheets that bind to the major groove of DNA. Four well-characterized structural motifs 
are helix-turn-helix, zinc finger, leucine zipper, and helix-loop-helix. Proteins containing these 
motifs may act alone as monomers, or they may form homo- or heterodimers that interact with DNA. 

Many neoplastic disorders in humans can be attributed to inappropriate gene expression. 
Malignant cell growth may result from either excessive expression of tumor promoting genes or 

20 insufficient expression of tumor suppressor genes (Cleary, ML, (1992) Cancer Surv. 15:89-104). 
Chromosomal translocations may also produce chimeric loci which fuse the coding sequence of one 
gene with the regulatory regions of a second unrelated gene. Such an arrangement likely results in 
inappropriate gene transcription, potentially contributing to malignancy. 

In addition, the immune system responds to infection or trauma by activating a cascade of 

2 5 events that coordinate the progressive selection, amplification, and mobilization of cellular defense 

mechanisms. A complex and balanced program of gene activation and repression is involved in this 
process. However, hyperactivity of the immune system as a result of improper or insufficient 
regulation of gene expression may result in considerable tissue or organ damage. This damage is well 
documented in immunological responses associated with arthritis, allergens, heart attack, stroke, and 
30 infections (Isselbacher, K.J. et aL (1996) Harrison's Principles of Internal Medicine. 13/e, McGraw 
Hill, Inc. and Teton Data Systems Software). 

TYanscription of DNA into RNA also takes place in the nucleus catalyzed by RNA 
polymerases. Hiree types of RNA polymerase exist RNA polymerase I makes large ribosomal 
RNAs, while RNA polymerase m makes a variety of small, stable RNAs including 5S ribosomal 
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RNA and the transfer RNAs (tRNA). RNA polymerase n transcribes genes that will be translated 
into proteins. The primary transcript of RNA polymerase II is called heterogenous nuclear RNA 
(hnRNA), and must be further processed by splicing to remove non-coding sequences called introns. 
RNA splicing is mediated by small nuclear ribonucleoprotein complexes, or snRNPs, producing 
mature messenger RNA (mRNA) which is then transported out of the nucleus for translation into 
proteins. 
Nucleolus 

The nucleolus is a highly organized subcompartment in the nucleus that contains high 
concentrations of RNA and proteins and junctions mainly in ribosomal RNA synthesis and assembly 
(Alberts, et al. supra , pp. 379-382). Ribosomal RNA (rRNA) is a structural RNA that is complexed 
with proteins to form ribonucleoprotein structures called ribosomes. Ribosomes provide the platform 
on which protein synthesis takes place, 

Ribosomes are assembled in the nucleolus initially from a large, 45S rRNA combined with a 
variety of proteins imported from the cytoplasm, as weU as smaller, 5S rRNAs. Lata: processing of 
the immature ribosome results in formation of smaller ribosomal subunits which are'transported from 
the nucleolus to the cytoplasm where they are assembled into functional ribosomes. 
Endoplasmic Reticulum 

In eukaryotes, proteins are synthesized within the endoplasmic reticulum (ER), delivered from 
the ER to the Golgi apparatus for post-translational processing and sorting, and transported from the 
Golgi to specific intracellular and extracellular destinations. Synthesis of integral membrane proteins, 
secreted proteins, and proteins destined for the lumen of a particular organelle occurs on the rough 
endoplasmic reticulum (ER). The rough ER is so named because of the rough appearance in electron 
micrographs imparted by the attached ribosomes on which protein synthesis proceeds. Synthesis of 
proteins destined for the ER actually begins in the cytosol with the synthesis of a specific signal 
peptide which directs the growing polypeptide and its attached ribosome to the ER membrane where 
the signal peptide is removed and protein synthesis is completed. Soluble proteins destined for the 
ER lumen, for secretion, or for transport to the lumen of other organelles pass completely into the ER 
lumen. Transmembrane proteins destined for the ER or for other cell membranes are translocated 
across the ER membrane but remain anchored in the lipid bilayer of the membrane by one or more 
membrane-spanning a-helical regions. 

Translocated polypeptide chains destined for other organelles or for secretion also fold and 
assemble in the ERlumen with the aid of certain "resident" ER proteins. Protein folding in the ER is 
aided by two principal types of protein isomerases, protein disulfide isomerase (PDI), and peptidyl- 
prolyl isomerase (PPI). PDI catalyzes the oxidation of free sulfliydryl groups in cysteine residues to 
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form intramolecular disulfide bonds in proteins. PPI, an enzyme that catalyzes the isomerization of 
certain proline imide bonds in oligopeptides and proteins, is considered to govern one of the rate 
limiting steps in the folding of many proteins to their final functional conformation. The cyclophilins 
represent a major class of PPI that was originally identified as the major receptor for the 
5 immunosuppressive drug cyclosporin A (Handschumacher, R.E. et al. (1984) Science 226:544-547). 
Molecular "chaperones" such as BiP (binding protein) in the ER recognize incorrectly folded proteins 
as well as proteins not yet folded into their final form and bind to them, both to prevent improper 
aggregation between them, and to promote proper folding. 

The "N-linked" glycosylation of most soluble secreted and membrane-bound proteins by 

1 o oligosacchrides linked to asparagine residues in proteins is also performed in the ER. This reaction is 

catalyzed by a membrane-bound enzyme, oligosaccharyl transferase. 
Golgi Apparatus 

The Golgi apparatus is a complex structure that lies adjacent to the ER in eukaryotic cells and 
serves primarily as a sorting and dispatching station for products of the ER (Alberts, et aL supra , pp. 

is 600-610). Additional posttranslational processing, principally additional glycosylation, also occurs in 
the Golgi. Indeed, the Golgi is a major site of carbohydrate synthesis, including most of the 
glycosaminoglycans of the extracellular matrix. N-linked oligosaccharides, added to proteins in the 
ER, are also further modified in the Golgi by the addition of more sugar residues to form complex N- 
hnked oligosaccharides. "O-linked" glycosylation of proteins also occur sin the Golgi by the addition 

20 of N-acetylgalactosamine to the hydroxyl group of a serine or threonine residue followed by the 
sequential addition of other sugar residues to the first. This process is catalyzed by a series of 
glycosyltransferases each specific for a particular donor sugar nucleotide and acceptor molecule 
(Lodish, H. et al. (1995) Molecular Cell Biology, W.H. Freeman and Co., New York NY, pp.700- 
708). In many cases; both N- and O-linked oligosaccharides appear to be required for the secretion of 

2 5 proteins or the movement of plasma membrane glycoproteins to the cell surface. 

The terminal compartment of the Golgi is the Trans-Golgi Network (TGN), where both 
membrane and lumenal proteins are sorted for their final destinatioa Transport (or secretory) vesicles 
destined for intracellular compartments, such as lysosomes, bud off of the TGN. Other transport 
vesicles bud off containing proteins destined for the plasma membrane, such as receptors, adhesion 

3 o molecules, and ion channels, and secretory proteins, such as hormones, neurotransmitters, and digestive 

enzymes. 
Vacuoles 

The vacuole system is a collection of membrane bound compartments in eukaryotic cells that 
functions in the processes of endocytosis and exocytosis. They include phagosomes, lysosomes, 
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endosomes, and secretory vesicles. Endocytosis is the process in ceDs of internalizing nutrients, solutes 
or small particles (pinocytosis) or large particles such as internalized receptors, viruses, bacteria, or 
bacterial toxins (phagocytosis). Exocytosis is the process of transporting molecules to the cell surface. 
It facilitates placement or localization of membrane-bound receptors or other membrane proteins and 
secretion of hormones, neurotransmitters, digestive enzymes, wastes, etc. 

A common property of all of these vacuoles is an acidic pH environment ranging from 
approximately pH 4.5-5.0. This acidity is maintained by the presence of a proton ATPase that uses the 
energy of ATP hydrolysis to generate an dectrochemical proton gradient across a membrane (Mellman, 
I. et al. (1986) Annu. Rev. Biochem 55:663-700). Eukaryotic vacuolar proton ATPase (vp-ATPase) is 
a multimeric enzyme composed of 3-10 different subunits. One of these subunits is a highly 
hydrophobic polypeptide of approximately 16 kDa that is similar to the proteolipid component of vp- 
ATPases from eubacteria, fungi, and plant vacuoles (Manuel, M. et ai. (1988) Proc. Natl Acad. Sci. 
USA 85:5521-5524). The 16 kDa proteolipid component is the major subunit of the membrane portion 
of vp-ATPase and functions in the transport of protons across the membrane. 
Lvsosomes 

Lysosomes are membranous vesicles containing various hydrolytic enzymes used for the 
controlled intracellular digestion of macromolecules. Lysosomes contain some 40 types of enzymes 
including proteases, nucleases, glycosidases, lipases, phospholipases, phosphatases, and sulfatases, all 
of which are acid hydrolases that junction at a pH of about 5. Lysosomes are surrounded by a unique 
membrane containing transport proteins that allow the final products of macromolecule degradation, 
such as sugars, amino acids, and nucleotides, to be transported to the cytosol where they may be 
either excreted or reutilized by the cell. A vp-ATPase, such as that described above, maintains the 
acidic environment necessary for hydrolytic activity (Alberts, supra, pp. 610-61 1). 
Endosomes 

Endosomes are another type of acidic vacuole that is used to transport substances from the 
cell surface to the interior of the cell in the process of endocytosis. Like lysosomes, endosomes have 
an acidic environment provided by a vp-ATPase (Alberts et al. supra, pp. 610-618). Two types of 
endosomes are apparent based on tracer uptake studies that distinguish their time of formation in the ' 
cell and their cellular location. Early endosomes are found near the plasma membrane and appear to 
function primarily in the recycling of internalized receptors back to the cell surface. Late endosomes 
appear later in the endocytic process close to the Golgi apparatus and the nucleus, and appear to be 
associated with delivery of endocytosed material to lysosomes or to the TGN whore they may be 
recycled. Specific proteins are associated with particular transport vesicles and their target 
compartments that may provide selectivity in targeting vesicles to their proper compartments. A 
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cytosolic prenylated GTP-binding protein, Rab, is one such protein. Rabs 4, 5, and 1 1 are associated 

with the early endosome* whereas Rabs 7 and 9 associate with the late endosome. 

Mitochondria 

Mitochondria are oval-shaped organdies comprising an outer membrane, a tightly folded 
5 inner membrane, an intennembrane space between the outer and inner membranes, and a matrix 
inside the inner membrane. The outer membrane contains many porin molecules that allow ions and 
charged molecules to enter the intennembrane space, while the inner membrane contains a variety of 
transport proteins that transfer only selected molecules. Mitochondria are the primary sites of energy 
production in cells. 

10 Energy is produced by the oxidation of glucose and fatty acids. Glucose is initially converted 

to pyruvate in the cytoplasm. Fatty acids and pyruvate are transported to the mitochondria for 
complete oxidation to C0 2 coupled by enzymes to the transport of electrons from NADH andFADH 2 
to oxygen and to the synthesis of ATP (oxidative phosphorylation) from ADP and Pj. 

Pyruvate is transported into the mitochondria and converted to acetyl-CoA for oxidation via 

is the citric acid cycled involving pyruvate dehydrogenase components, dihy drolipoyl transacetylase, and 
dihydrolipoyl dehydrogenase. Enzymes involved in the citric acid cycle include: citrate Synthetase, 
aconitases, isocitrate dehydrogenase, alpha-ketoglutarate dehydrogenase complex including 
transsuccinylases, succinyl CoA synthetase, succinate dehydrogenase, ftunarases, and malate 
dehydrogenase. Acetyl CoA is oxidized to C0 2 with concomitant formation of NADH, FADH 2 , and 

2 o GTP. In oxidative phosphorylation, the transfer of electrons from NADH and FADH 2 to oxygen by 

dehydrogenases is coupled to the synthesis of ATP from ADP and V i by the FqFj ATPase complex in 
the mitochondrial inner membrane. Enzyme complexes responsible for electron transport and ATP 
synthesis include the F^ ATPase complex, ubiquinone(CoQ)-cytochrome c reductase, ubiquinone 
reductase, cytochrome b, cytochrome FeS protein, and cytochrome c oxidase. 
25 Peroxisomes 

Peroxisomes, like mitochondria, are a major site of oxygen utilization. They contain one or 
more enzymes, such as catalase and urate oxidase, that use molecular oxygen to remove hydrogen 
atoms from specific organic substrates in an oxidative reaction that produces hydrogen peroxide 
(Alberts, supra, pp. 574-577). Catalase oxidizes a variety of substrates including phenols, formic 

3 o acid, formaldehyde, and alcohol and is important in peroxisomes of liver and kidney cells for 

detoxifying various toxic molecules that enter the bloodstream. Another major function of oxidative 
reactions in peroxisomes is the breakdown of fatty acids in a process called P oxidation. P oxidation 
results in shortening of the alkyl chain of fatty acids by blocks of two carbon atoms that are converted 
to acetyl CoA and exported to the cytosol for reuse in biosynthetic reactions. 
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Also like mitochondria, peroxisomes import their proteins from the cytosol using a specific 
signal sequence located near the C -terminus of the protein. The importance of this import process is 
evident in the inherited human disease Zellweger syndrome, in which a defect in importing proteins 
into perixosomes leads to a perixosomal deficiency resulting in severe abnormalities in the brain, 
liver, and kidneys, and death soon after birth. One form of this disease has been shown to be due to a 
mutation in the gene encoding a perixosomal integral membrane protein called peroxisome assembly 
factor-1. 

The discovery of new human molecules satisfies a need in the art by providing new 
compositions which are useful in the diagnosis, study, prevention, and treatment of diseases associated 
with, as well as effects of exogenous compounds on, the expression of human molecules. 

SUMMARY OF THE INVENTION 
The present invention relates to nucleic acid sequences comprising human diagnostic and 
therapeutic polynucleotides (dithp) as presented in the Sequence Listing. The dithp uniquely identify 
genes encoding human structural, functional, and regulatory molecules. 

The invention provides an isolated polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:l-211; b) a naturally occurring polynucleotide sequence having at least 90% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO: 1 -2 1 1 ; c) a 
polynucleotide sequence complementary to a); d) a polynucleotide sequence complementary to b); and 
e) an RNA equivalent of a) through d). In one alternative, the polynucleotide comprises a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-21 1 . In another 
alternative, the polynucleotide comprises at least 60 contiguous nucleotides of a polynucleotide 
sequence selected from the group consisting of a) a polynucleotide sequence selected from the group 
consisting of SEQ ID NO: 1-21 1 ; b) a naturally occurring polynucleotide sequence having at least 90% 
sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:l- 
21 1 ; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence complementary to 
b); and e) an RNA equivalent of a) through d). The invention further provides a composition for the 
detection of expression of human diagnostic and therapeutic polynucleotides comprising at least one 
isolated polynucleotide comprising a polynucleotide sequence selected from the group consisting of a) a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 1 -2 1 1 ; b) a naturally 
occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence 
selected from the group consisting of SEQ ID NO:l-21 1 ; c) a polynucleotide sequence complementary 
to a); d) a polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through d); 
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and a detectable label. 

The invention also provides a method for detecting a target polynucleotide in a sample, said 
target polynucleotide comprising a polynucleotide sequence selected from the group consisting of a) a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:l-21 1 ; b) a naturally 
occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence 
selected from the group consisting of SEQ ID NO:l -21 1 ; c) a polynucleotide sequence complementary 
to a); d) a polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through d). 
The method comprises a) amplifying said target polynucleotide or a fragment thereof using polymerase 
chain reaction amplification, and b) detecting the presence or absence of said amplified target 
polynucleotide or fragment thereof, and, optionally, if present, the amount thereof. 

The invention also provides a method for detecting a target polynucleotide in a sample, said 
target polynucleotide comprising a polynucleotide sequence selected from the group consisting of a) a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:l-21 1 ; b) a naturally 
occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence 
selected from the group consisting of SEQ ID NO: 1-21 1 ; c) a polynucleotide sequence complementary 
to a); d) a polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through d). 
The method comprises a) hybridizing the sample with a probe comprising at least 20 contiguous 
nucleotides comprising a sequence complementary to said target polynucleotide in the sample, and 
which probe specifically hybridizes to said target polynucleotide, undo* conditions whereby a 
hybridization complex is formed between said probe and said target polynucleotide, and b) detecting the 
presence or absence of said hybridization complex, and, optionally, if present, the amount thereof. In 
one alternative, the probe comprises at least 30 contiguous nucleotides. In another alternative, the 
probe comprises at least 60 contiguous nucleotides. 

The invention further provides a recombinant polynucleotide comprising a promoter sequence 
operably linked to an isolated polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:l- 
21 1 ; b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 1 -21 1 ; c) a polynucleotide 
sequence complementary to a); d) a polynucleotide sequence complementary to b); and e) an RNA 
equivalent of a) through d). In one alternative, the invention provides a cell transformed with the 
recombinant polynucleotide. In another alternative, the invention provides a transgenic organism 
comprising the recombinant polynucleotide. In a further alternative, the invention provides a method 
for producing a human diagnostic and therapeutic polypeptide, the method comprising a) culturing a 
cell under conditions suitable for expression of the human diagnostic and therapeutic polypeptide, 
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wherein said ceQ is transformed with the recombinant polynucleotide, and b) recovering the human 
diagnostic and therapeutic polypeptide so expressed. 

The invention also provides a purified human diagnostic and therapeutic polypeptide (DITHP) 
encoded by at least one polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO: 1 -21 1 . Additionally, the invention provides an isolated antibody which 
specifically binds to the human diagnostic and therapeutic polypeptide. The invention further provides 
a method of identifying a test compound which specifically binds to the human diagnostic and 
therapeutic polypeptide, the method comprising the steps of a) providing a test compound; b) combining 
the human diagnostic and therapeutic polypeptide with the test compound for a sufficient time and 
undo: suitable conditions for binding; and c) detecting binding of the human diagnostic and therapeutic 
polypeptide to the test compound, thereby identifying the test compound which specifically binds the 
human diagnostic and therapeutic polypeptide. 

The invention further provides a microarray wherein at least one element of the microarray is 
an isolated polynucleotide comprising at least 60 contiguous nucleotides of a polynucleotide comprising 
a polynucleotide sequence selected from the group consisting of a) a polynucleotide sequence selected 
from the group consisting of SEQ ID NO:l-21 1 ; b) a naturally occurring polynucleotide sequence 
having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting 
of SEQ ID NO:l-21 1; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence 
complementary to b); and e) an RNA equivalent of a) through d). The invention also provides 1 a method 
for generating a transcript image of a sample which contains polynucleotides. The method comprises a) 
labeling the polynucleotides of the sample, b) contacting the elements of the microarray with the labeled 
polynucleotides of the sample under conditions suitable for the formation of a hybridization complex, 
and c) quantifying the expression of the polynucleotides in the sample. 

Additionally, the invention provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
polynucleotide sequence selected from the group consisting of a) a polynucleotide sequence selected 
from the group consisting of SEQ ID NO:l-21 1 ; b) a naturally occurring polynucleotide sequence 
having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting 
of SEQ ID NO:l-21 1 ; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence 
complementary to b); and e) an RNA equivalent of a) through d). The method comprises a) exposing a 
sample coiqprising the target polynucleotide to a compound, and b) detecting altered expression of the 
target polynucleotide, and c) comparing the expression of the target polynucleotide in the presence of 
varying amounts of the compound and in the absence of the compound. 

The invention further provides a method for assessing toxicity of a test compound, said method 
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comprising a) treating a biological sample containing nucleic acids with the test compound; b) 
hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID NO:l- 
211; ii) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:l-21 1 ; iii) a polynucleotide 
sequence complementary to i), iv) a polynucleotide sequence complementary to ii), and v) an RNA 
equivalent of i)-iv). Hybridization occurs under conditions whereby a specific hybridization complex 
is formed between said probe and a target polynucleotide in the biological sample, said target 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of i) a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-211; ii) a naturally 
occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide 
sequence selected from the group consisting of SEQ ID NO: 1-21 1 ; iii) a polynucleotide sequence 
complementary to i), iv) a polynucleotide sequence complementary to ii), and v) an RNA equivalent 
of i)-iv), and alternatively, the target polynucleotide comprises a fragment of a polynucleotide sequence 
selected from the group consisting of i-v above; c) quantifying the amount of hybridization complex; 
and d) comparing the amount of hybridization complex in the treated biological sample with the amount . 
of hybridization complex in an untreated biological sample, wherein a difference in the amount of 
hybridization complex in the treated biological sample is indicative of toxicity of the test compound 
The invention further provides an isolated polypeptide comprising an amino acid sequence 
selected from the group consisting of a) an amino acid sequence selected from the group consisting of 
SEQ ID NO:212-422, b) a naturally occurring amino acid sequence having at least 90% sequence 
identity to an amino acid sequence selected from the group consisting of SEQ ID NO:212-422, c) a 
biologically active fragment of an amino add sequence selected from the group consisting of SEQ ID 
NO:21 2-422, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ED NO:21 2-422. In one alternative, the invention provides an isolated polypeptide 
comprising the amino acid sequence of SEQ ID NO:212-422. 

DESCRIPTION OF THE TABLES 

Table 1 shows the sequence identification numbers (SEQ ID NO:s) and template identification 
numbers (tenjplate IDs) corresponding to the polynucleotides of the present invention, along with their 
GeuBank hits (GI Numbers), probability scores, and functional annotations corresponding to the 
GenBankhits. 

Table 2 shows the sequence identification numbers (SEQ ID NO:s) and template identification 
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numbers (template IDs) corresponding to the polynucleotides of the present invention, along with 
polynucleotide segments of each template sequence as defined by the indicated "start" and "stop" 
nucleotide positions. Hie reading frames of the polynucleotide segments and the Pf am hits, Pf am 
descriptions, and E- values corresponding to the polypeptide domains encoded by the polynucleotide 
segments are indicated 

Table 3 shows the sequence identification numbers (SEQ ID NO:s) and template identification 
numbers (template IDs) corresponding to the polynucleotides of the present invention, along with 
polynucleotide segments of each template sequence as defined by the indicated "start" and "stop" 
nucleotide positions. The reading frames of the polynucleotide segments are shown, and the 
polypeptides encoded by the polynucleotide segments constitute either signal peptide (SP) or 
transmembrane (TM) domains, as indicated. The membrane topology of the encoded polypeptide 
sequence is indicated, the N-terminus (N) listed as being oriented to either the cytosolic (in) or non- 
cytosolic (out) side of the cell membrane or organelle. 

Table 4 shows the sequence identification numbers (SEQ ED NO:s) corresponding to the 
polynucleotides of the present invention, along with component sequence identification numbers 
(component IDs) corresponding to each template. The component sequences, which were used to 
assemble the template sequences, are defined by the indicated "start" and "stop" nucleotide positions 
along each template. 

Table 5 shows the tissue distribution profiles for the templates of the inventioa 

Table 6 shows the sequence identification numbers (SEQ ID NO:s) corresponding to the 
polypeptides of the present invention, along with the reading frames used to obtain the polypeptide 
segments, the lengths of the polypeptide segments, the "start" and "stop" nucleotide positions of the 
polynucleotide sequences used to define the encoded polypeptide segments, the GenBankhits (GI 
Numbers), probability scores, and functional annotations corresponding to the GenBank hits. 

Table 7 summarizes the bioinformatics tools which are useful for analysis of the 
polynucleotides of the present invention. The first column of Table 7 lists analytical tools, programs, 
and algorithms, the second column provides brief descriptions thereof, the third column presents 
appropriate references, all of which are incorporated by reference herein in their entirety, and the fourth 
column presents, where applicable, the scores, probability values, and other parameters used to evaluate 
the strength of a match between two sequences (the higher the score, the greater the homology between 
two sequences). 

DETAILED DESCRIPTION OF THE INVENTION 

Before the nucleic acid sequences and methods are presented, it is to be understood that this 
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invention is not limited to the particular machines, methods, and materials described Although 
particular embodiments are described, machines, methods, and materials similar or equivalent to these 
embodiments may be used to practice the invention. The preferred machines, methods, and materials 
s et forth are not intended to limit the s cop e of the invention which is limited only b y the appended 
5 claims. 

The singular forms "a", "an", and "the" include plural reference unless the context clearly 
dictates otherwise. All technical and scientific terms have the meanings commonly understood by one 
of ordinary skill in the art All publications are incorporated by reference for the purpose of describing 
and disclosing the cell lines, vectors, and methodologies which are presented and which might be used in 
10 connection with the invention. Nothing in the specification is to be construed as an admission that the 
invention is not entitled to antedate such disclosure by virtue of prior invention. 

Definitions 

As used herein, the lower case "dithp" refers to a nucleic acid sequence, while the upper case 
1 5 "DYTHF' refers to an amino acid sequence encoded by dithp. A "full-length" dithp refers to a nucleic 
add sequence containing the entire coding region of a gene endogenously expressed in human tissue. 

"Adjuvants" are materials such as Freund's adjuvant, mineral gels (aluminum hydroxide), and 
surface active substances (lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole 
limpet hemocyanin, and dinitrqphenol) which may be administered to increase a host's immunological 
20 response. 

"Allele" refers to an alternative form of a nucleic acid sequence. Alleles result from a 
"mutation," a change or an alternative reading of the genetic code. Any given gene may have none, one, 
or many allelic forms. Mutations which give rise to alleles include deletions, additions, or substitutions 
of nucleotides. Each of these changes may occur alone, or in combination with the others, one or more 

2 5 times in a given nucleic acid sequence. The present invention encompasses allelic dithp. 

"Amino acid sequence" refers to a peptide, a polypeptide, or a protein of either natural or 
synthetic origin. The amino acid sequence is not limited to the complete, endogenous amino acid 
sequence and may be a fragment, epitope, variant, or derivative of a protein expressed by a nucleic acid 
sequence* 

3 o "Amplification" refers to the production of additional copies of a sequence and is carried out 

using polymerase chain reaction (PCR) technologies well known in the art. 

"Antibody" refers to intact molecules as well as to fragments thereof, such as Fab, F(ab% and 
Fv fragments, which are capable of binding the epitopic determinant. Antibodies that bind DITHP 
polypeptides can be prepared using intact polypeptides or using fragments containing small peptides of 
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interest as the immunizing antigen. The polypeptide or peptide used to immunize an animal (e. g. , a 
mouse, a rat, or a rabbit) can be derived from the translation of RNA, or synthesized chemically, and 
can be conjugated to a carrier protein if desired Commonly used carriers that are chemically coupled 
to peptides include bovine serum albumin, thyroglobulin, and keyhole limpet hemocyanin (KLH). The 
coupled peptide is then used to immunize the animal. 

"Antisense sequence" refers to a sequence capable of specifically hybridizing to a target 
sequence. The antisense sequence may include DNA, RNA, or any nucleic acid mimic or analog such 
as peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2'-methoxyethyl sugars or 2'-rnethoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5 -methyl cytosine, 2'-deoxyuracil, or 7-deaza~2'-deoxyguanosine. 

"Antisense sequence" refers to a sequence capable of specifically hybridizing to a target 
sequence. The antisense sequence can be DNA, RNA, or any nucleic acid mimic or analog. 

"Antisense technology" refers to any technology which relies on the specific hybridization of an 
antisense sequence to a target sequence. 

A "bin" is a portion of computer memory space used by a compute program for storage of 
data, and bounded in such a manner that data stored in a bin may be retrieved by the program. 

"Biologically active" refers to an amino acid sequence having a structural, regulatory, or 
biochemical function of a naturally occurring amino acid sequence. 

"Clone joining" is a process for combining gene bins based upon the bins' containing sequence 
. information from the same clone. The sequences may assemble into a primary gene transcript as well 
as one or more splice variants. 

"Complementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing (5-A-G-T-3' pairs with its complement 3-T-C-A-5'). 

A "component sequence" is a nucleic acid sequence selected by a computer program such as 
PHRED and used to assemble a consensus or template sequence from one or more component 
sequences. 

A "consensus sequence" or "template sequence" is a nucleic acid sequence which has been 
assembled from overlapping sequences, using a computer program for fragment assembly such as the 
GELVIEW fragment assembly system (Gaieties Compute Group (GCG), Madison WI) or using a 
relational database management system (RDMS). 

"Conservative amino acid substitutions" are those substitutions that, when made, least interfere 
with the properties of the original protein, i.e., the structure and especially the function of the protein is 
conserved and not significantly changed by such substitutions. The table below shows amino acids 
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which may be substituted for an original amino acid in a protein and which are regarded as conservative 
substitutions. 





Original Residue 


Conservative Substitution 


5 


Ala 


Gly, Ser 




Arg 


His, Lys 




Asn 


Asp, Gin, His 




Asp 


Asn, Glu 




Cys 


Ala, Ser 


10 


Gin 


Asn, Glu, His 




Glu 


Asp, Gin, His 




Gly 


Ala 




His 


Asn, Arg, Gin, Glu 




lie 


Leu, Val 


15 


Leu 


lie, Val 




Lys 


Arg, Gin, Glu 




Met 


Leu, lie 




Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 


20 


Thr 


Ser, Val 




Trp 


Phe, Tyr 




Tyr 


His, Phe, Trp 




Val 


He, Leu, Thr 



Conservative substitutions generally maintain (a) the structure of the polypeptide backbone in 
the area of the substitution, for example, as a beta sheet or alpha helical conformation, (b) the charge or 
hydrophobicity of the molecule at the target site* or (c) the bulk of the side chain. 

"Deletion" refers to a change in either a nucleic or amino acid sequence in which at least one 
3 o nucleotide or amino acid residue, respectively, is absent. 

"Derivative" refers to the chemical modification of a nucleic acid sequence, such as by 
replacement of hydrogen by an alkyl, acyl, amino, hydroxyl, or other group. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 

3 5 lvalue" refers to the statistical probability that a match between two sequences occurred by 

chance. 

A "fragment" is a unique portion of dithp or DITHP which is identical in sequence to but 
shorter in length than the parent sequence. A fragment may comprise up to the entire length of the 
defined sequence, minus one nucleotide/amino acid residue. For example, a fragment may comprise 

4 o from 10 to 1000 contiguous amino acid residues or nucleotides. A fragment used as a probe, primer, 
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antigen, therapeutic molecule, or for other proposes, may be at least 5, 10, 15, 16, 20, 25, 30, 40, 50, 
60, 75, 100, 150, 250 or at least 500 contiguous amino acid residues or nucleotides in length. 
Fragments may be preferentially selected from certain regions of a molecule. For example, a 
polypeptide fragment may comprise a certain length of contiguous amino acids selected from the first 
250 or 500 amino acids (or first 25% or 50%) of a polypeptide as shown in a certain defined sequence. 
Clearly these lengths are exemplary, and any length that is supported by the specification, including the 
Sequence Listing and the figures, may be encompassed by the present embodiments. 

A fragment of dithp comprises a region of unique polynucleotide sequence that specifically 
identifies dithp, for example, as distinct from any other sequence in the same genome. A fragment of 
dithp is useful, for example, in hybridization and amplification technologies and in analogous methods 
that distinguish dithp from related polynucleotide sequences. The precise length of a fragment of dithp 
and the region of dithp to which the fragment corresponds are routinely determinable by one of ordinary 
skill in the art based on the intended purpose for the fragment 

A fragment of DITHP is encoded by a fragment of dithp. A fragment of DITHP comprises a 
region of unique amino acid sequence that specifically identifies DITHP. For example, a fragment of 
DITHP is useful as an immunogenic peptide for the development of antibodies that specifically 
recognize DITHP. The precise length of a fragment of DITHP and the region of DITHP to which the 
fragment corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
purpose for the fragment. 

A "full length" nucleotide sequence is one containing at least a start site for translation to a 
protean sequence, followed by an open reading frame and a step site, and encoding a 'full length" 
polypeptide. 

"Hit" refers to a sequence whose annotation will be used to describe a given template. Criteria 
for selecting the top hit are as follows: if the template has one or more exact nucleic add matches, the 
top hit is the exact match with highest percent identity. If the template has no exact matches but has 
significant protein hits, the top hit is the protein hit with the lowest E- value. If the template has no 
significant protein hits, but does have significant non-exact nucleotide hits, the top hit is the nucleotide 
hit with the lowest E-value. 

"Homology" refers to sequence similarity either between a reference nucleic acid sequence and 
at least a fragment of a dithp or between a reference amino acid sequence and a fragment of a DITHP. 

"Hybridization" refers to the process by which a strand of nucleotides anneals with a 
complementary strand through base pairing. Specific hybridization is an indication that two nucleic 
acid sequences share a high degree of identity. Specific hybridization complexes form under defined 
annealing conditions, and remain hybridized after the "washing" step. The defined hybridization 
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conditions include the annealing conditions and the washing step(s), the latter of which is particularly 
important in determining the stringency of the hybridization process, with more stringent conditions 
allowing less non-specific binding, Le, binding between pairs of nucleic acid probes that are not 
perfectly matched Permissive conditions for annealing of nucleic acid sequences are routinely 
5 determinable and may be consistent among hybridization experiments, whereas wash conditions may be 
varied among experiments to achieve the desired stringency. 

GeneraUy, stringency of hybridization is expressed with reference to the temperature under 
which the wash step is carried out Generally, such wash temperatures are selected to be about 5°C to 
20°C lower than the thermal melting point (TJ for the specific sequence at a defined ionic strength and 
10 pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of the target 
sequence hybridizes to a perfectly matched probe. An equation for calculating T m and conditions for 
nucleic acid hybridization is wdl known and can be found in Sambrook et ai., 1989, Molecular 
Cloning: A Laboratory Manual, 2 nd ed, vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically 
see volume 2, chapter 9, 

1 5 High stringency conditions for hybridization between polynucleotides of the present invention 

include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0. 1% SDS, for 1 hour. 
Alternatively, ten^eratures of about 65°C, 60°C, or 55°C may be used SSC concentration may be 
varied from about 0.2 to 2 x SSC, with SDS being present at about 0.1%. Typically, blocking reagents, 
are used to block non-specific hybridizatioa Such blocking reagents include, for instance, denatured 

2 o salmon sperm DNA at about 100-200 jug/ml. Useful variations on these conditions will be readily 
apparent to those skilled in the art. Hybridization, particularly under high stringency conditions, may 
be suggestive of evolutionary similarity between the nucleotides. Such similarity is strongly indicative 
of a similar role for the nucleotides and their resultant proteins. 

Other parameters, such as temperature, salt concentration; and detergent concentration may be 

2 5 varied to achieve the desired stringency. Denaturants, such as formamide at a concentration of about 

35-50% v/v, may also be used under particular circumstances, such as RNA:DNA hybridizations. 
Appropriate hybridization conditions are routinely determinable by one of ordinary skill in the art. 

"Immunogenic" describes the potential for a natural, recombinant, or synthetic peptide, epitope, 
polypeptide, or protein to induce antibody production in appropriate animals, cells, or cell lines. 

3 o 'Insertion" or "addition" refers to a change in either a nucleic or amino acid sequence in which 

at least one nucleotide or residue, respectively, is added to the sequence. 

"Labeling" refers to the covalent or noncovalent joining of a polynucleotide, polypeptide, or 
antibody with a reporter molecule capable of producing a detectable or measurable signal. 
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"Microarray" is any arrangement of nucleic acids, amino acids, antibodies, etc., on a substrate. 
The substrate may be a solid support such as beads, glass, paper, nitrocellulose, nylon, or an 
appropriate membrane. 

'Tinkers" are short stretches of nucleotide sequence which may be added to a vector or a dithp 
to create restriction endonuclease sites to facilitate cloning. "Polylinkers" are engineered to incorporate 
multiple restriction enzyme sites and to provide for the use of enzymes which leave 5' or 3 f overhangs 
(e.g., BamHI, EcoRI, and HindTTT) and those which provide blunt ends (ag., EcoRV, SnaBI, and StuI). 

"Naturally occurring" refers to an endogenous polynucleotide or polypeptide that may be 
isolated from viruses or prokaryotic or eukaryotic cells. 

"Nucleic acid sequence" refers to the specific order of nucleotides joined by phosphodiester 
. bonds in a linear, polymeric arrangement Depending on the number of nucleotides, the nucleic acid 
. sequence can be considered an oligomer, oligonucleotide, or polynucleotide. The nucleic acid can be 
DN A, RNA, or any nucleic acid analog, such as PNA, may be of genomic or synthetic origin, may be 
either double-stranded or single-stranded, and can represent either the sense or antisense 
(complementary) strand. 

"Oligomer" refers to a nucleic acid sequence of at least about 6 nucleotides and as many as 
about 60 nucleotides, preferably about 15 to 40 nucleotides, and most preferably between about 20 and 
30 nucleotides, that may be used in hybridization or amplification technologies. Oligomers may be used 
as, e.g., primers for PCR, and are usually chemically synthesized. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with the second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence, Generally, operably linked DN A sequences may be in close proximity or contiguous and, 
where necessary to join two protein coding regions, in the same reading frame. 

'Peptide nucleic acid" (PNA) refers to a DNA mimic in which nucleotide bases are attached to 
a pseudopeptide backbone to increase stability. PNAs, also designated antigeae agents, can prevent 
gene expression by targeting complementary messenger RNA 

The phrases "percent identity" and "% identity", as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps in 
the sequences being compared in order to optimize alignment between two sequences, and therefore 
achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e sequence 
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alignment program. TTiis program is part of the LASERGENE software package* a suite of molecular 
biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G. 
and Sharp, P.M. (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. 
For pairwise alignments of polynucleotide sequences, the default parameters are set as follows: 
5 Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue weight table is 
selected as the default Percent ideaitity is reported by CLUSTAL V as the "percent similarity" between 
aligned polynucleotide sequence pairs. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms is 
provided by the National Centex for Biotechnology Information (NCBI) Basic Local Alignment Search 
10 Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), whichis available from several 
sources, including the NCBI, Bethesda, MD, and on the Internet at 

hltp7/ww.ncbLnlm.nih ; gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to determine alignment between a known polynucleotide 
sequence and other sequences on a variety of databases. Also available is a tool called "BLAST 2 
1 5 Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at httpy/www.ncbi.nlm. nih.gov/gorf/bl2/. The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 

2 o 2.0.9 (May-07- 1999) set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSVM62 
Reward for match: 1 
Penalty for mismatch: -2 
Open Gap: 5 and Extension Gap: 2 penalties 
25 Gap x drop z off: 50 

Expect: 10 
Word Size: 11 
Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, as 

3 o defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over 

the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at least 20, at 
least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such 
lengths are exemplary only, and it is understood that any fragment length supported by the sequences 
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shown herein, in figures or Sequence Listings, may be used to describe a length over which percentage 
identity may be measured 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes in 
nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences 
that all encode substantially the same protein. 

The phrases "percent identity" and "% identity", as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm Methods of polypeptide sequence alignment are well-known. Some alignment 
methods take into account conservative amino acid substitutions. Such conservative substitutions, 
explained in more detail above, generally preserve the hydrqphobicity and acidity of the substituted 
residue, thus preserving the structure (and therefore function) of toe folded polypeptide; 

Percent identity between polypeptide sequences may be determined using the default parameters 
of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e sequence alignment 
program (described and referenced above). For pairwise alignments of polypeptide sequences using 
CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap penalty^, window=5, and 
"diagonals saved"=5. The PAM250 matrix is selected as the default residue weight table. As with 
polynucleotide alignments, the percent identity is reported by CLUSTAL V as the ''percent similarity" 
between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0.9 
(May-07-1999) with blastp set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalty 
Gap x drop-off: 50 
Expect: 10 
Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, for 
example, as defined by a particular SEQ ED number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, 
a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 
contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length 
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supported by the sequences shown heron, in figures or Sequence Listings, may be used to describe a 
length ova: which percentage identity may be measured. 

"Post-translational modification" of a DITHP may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage* and other modifications known in the 
5 art These processes may occur synthetically or biochemically. Biochemical modifications will vary by 
cell type depending on the enzymatic milieu and the DITHP. 

"Probe" refers to dithp or fragments thereof, which are used to detect identical, allelic or related 
nucleic acid sequences. Probes are isolated oligonucleotides or polynucleotides attached to a detectable 
label or reporter molecule. Typical labels include radioactive isotopes, ligands, chemiluminescent 
10 agents, and enzymes. "Primers" are short nucleic acids, usually DNA oligonucleotides, which may be 
annealed to a target polynucleotide by complementary base-pairing. The primer may then be extended 
along the target DNA strand by a DNA polymerase enzyme Prima' pahs can be used for amplification 
(and identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
15 nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 30, 40, 50, 60, 70, 80, 90, 100, or 
at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers may 
be considerably longer than these examples, and it is understood that any length supported by the * 
specification, including the figures and Sequence Listing, may be used. 

2 o Methods for preparing and using probes and primers are described in the references, for 

example Sambrook et al, 1989, Molecular Cloning: A Laboratory Manual , 2 nd ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubd et al., 1987, Current Protocols in Molecular Biology, 
Greene Publ. Assoc. & Wfley-Intersdences, New York NY; Innis et al., 1990, PCR Protocols. A Guide 
to Methods and Applications. Academic Press, San Diego C A PCR primer pairs can be derived from 
25 a known sequence, for example, by using computer programs intended for that purpose such as Primer 
(Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example* OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 

3 o nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 

programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
primer selection program (available to the public from the Genome Center at University of Texas South 
West Medical Cento, Dallas TX) is capable of choosing specific primers from megabase sequences 
and is thus useful for designing primers on a genome- wide scope. The Primer3 prima- selection 
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program (available to the public from the Whitehead Institute/MIT Center for Genome Research, 
Cambridge MA) allows the user to input a 44 mispriming library," in which sequences to avoid as primer 
binding sites are user-specified. Primer3 is useful, in particular, for the selection of oligonucleotides for 
microarrays. (The source code for the latter two primer selection programs may also be obtained from 
5 their respective sources and modified to meet the user's specific needs.) The PrimeGen program 
(available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge 
UK) designs primers based on multiple sequence alignments, thffeby allowing selection of primers that 
hybridize to either the most conserved or least censored regions of aligned nucleic acid sequences. 
Hence, this program is useful for identification of both unique and conserved oligonucleotides and 
10 polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by any of the 
above selection methods are useful in hybridization technologies, for example, as PCR or sequencing 
primers, microarray elements, or specific probes to identify fully or partially complementary 
polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to 
those described above. 

15 '"Purified" refers to molecules, dither polynucleotides or polypeptides that are isolated or 

separated from their natural environment and are at least 60% free, preferably at least 75% free, and 
most preferably at least 90% free from other compounds with which they are naturally associated 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
that is made by an artificial combination of two or more otherwise separated segments of sequence. 
20 This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic adds, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . Hie tern recombinant includes nucleic acids that have 
been altered soMy by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence. 
2 5 Such a recombinant nucleic acid may be part of a vector that is used, for example, to transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

"Regulatory dement" refers to a nucleic acid sequence from nontranslated regions of a gene, 
30 and includes enhancers, promoters, introns, and 3' untranslated regions, which interact with host 
proteins to cany out or regulate transoiption or translation 

"Reporter" molecules are chemical or biochemical moieties used for labeling a nucleic acid, an 
amino acid, or an antibody. They include radionuclides; enzymes; fluorescent, chemiluminescent, or 
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cbromogenic agents; substrates; cof actors; inhibitors; magnetic particles; and other moieties known in 
the art 

An "RNA equivalent," in reference to a DNA sequence* is composed of the same linear 
sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
5 nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 

"Sanq>le" is used in its broadest sense. Samples may contain nucleic or amino acids, 
antibodies, or other materials, and may be derived from any source (e.g., bodily fluids including, but not 
limited to, saliva, blood, and urine; chromosome(s), organelles, or membranes isolated from a cell; 

1 o genomic DNA, RNA, or cDNA in solution or bound to a substrate; and cleared ceils or tissues or blots 

or imprints from such cells or tissues). 

"Specific binding" or "specifically binding" refers to the interaction between a protein or 
peptide and its agonist, antibody, antagonist, or other binding partner. The interaction is dependent 
upon the presence of a particular structure of the protein, e.g., the antigenic determinant or epitope, 
15 recognized by the binding molecule. For example, if an antibody is specific for epitope "A," the 
presence of a polypeptide containing epitope A, or the presence of free unlabeled A, in a reaction 
containing free labeled A and the antibody will reduce the amount of labeled A that binds to the 
antibody. 

"Substitution" refers to the replacement of at least one nucleotide or amino acid by a different 

2 o nucleotide or amino acid. 

"Substrate" retes to any suitable rigid or semi-rigid support including, e.g., membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gds, tubing, plates, polymers, 
micrqparticles or capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

25 A "transcript image" refers to the collective patten of gene expression by a particular tissue or 

cell type under given conditions at a given time. 

"Transformation" refers to a process by which exogenous DNA enters a recipient cell. 
Transformation may occur under natural or artificial conditions using various methods well known in 
the art Transformation may rely on any known method for the insertion of foreign nucleic acid 

30 sequences into a prokaryotic or eukaryotic host cell. The method is selected based on the host cell being 
transformed. 

'Transformants" include stably transformed cells in which the inserted DNA is capable of 
replication either as an autonomously replicating plasmid or as part of the host chromosome, as well as 
cells which transiently express inserted DNA or RNA. 

125 



WO 01/62927 



PCTAJS01/06059 



A "transgenic organism," as used herein, is any organism, including but not limited to animals 
and plants, in which one or more of the cells of the organism contains heterologous nucleic add 
introduced by way of human intervention, such as by transgenic techniques well known in the art The 
nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor of the cell, 
5 by way of deliberate genetic manipulation, such as by microinjection or by infection with a recombinant 
virus. The term genetic manipulation does not include classical cross-breeding, or in vitro fertilization, 
but rather is directed to the introduction of a recombinant DN A molecule. The transgenic organisms 
contemplated in accordance with the present invention include bacteria, cyanobacteria, fungi, and plants 
and animals. The isolated DNA of the present invention can be introduced into the host by methods 

10 known in the art, for example infection, transfection, transformation or transconjugation. Techniques 
for transferring the DNA of the present invention into such organisms are widely known and provided in 
references such as Sambrook et aL (1989), supra . 

A 'Variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having at 
least 25% sequence identity to the particular nucleic acid sequence ova: a certain length of one of the 

is nucleic, acid sequences using blasto with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of nucleic acids may show, for example, at least 30%, at least 
50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% or even at least 98% or 
greater sequence identity over a certain defined length. The variant may result in "conservative" amino , 
acid changes which do not affect structural and/or chemical properties. A variant may be described as, 

20 for example, an "allelic" (as defined above), "splice," "species," or "polymorphic" variant A splice 
variant may have significant identity to a reference molecule, but will generally have a greater or lesser 
number of polynucleotides due to alternate splicing of exons during mRN A processing. The 
corresponding polypeptide may possess additional functional domains or lack domains that are present 
in the reference molecule, Species variants are polynucleotide sequences that vary from one species to 

25 another. The resulting polypeptides generally will have significant amino acid identity relative to each 
other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene between 
individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one base. The presence of 
SNPs may be indicative of, for example, a certain population, a disease state, or a propensity for a 

30 disease state. 

In an alternative, variants of the polynucleotides of the present invention may be generated 
through recombinant methods. One possible method is a DNA shuffling technique such as 
MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C. et aL (1999) Nat Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat 
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Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat Biotechnol. 14:315-319) to alter or improve 
the biological properties of DITHP, such as its biological or enzymatic activity or its ability to bind to 
other molecules or compounds. DNA shuffling is a process by which a library of gene variants is 
produced using PCR-mediated recombination of gene fragments. The library is then subjected to 
selection or screening procedures that identify those gene variants with the desired properties. These 
preferred variants may then be pooled and further subjected to recursive rounds of DNA shuffling and 
sdection/screening. Thus, genetic diversity is created through "artificial" breeding and rapid molecular 
evolution. For example, fragments of a single gene containing random point mutations may be 
recombined, screened, and then reshuffled until the desired properties are optimized. Alternatively, 
fragments of a given gene may be recombined with fragments of homologous genes in the same gene 
family, either from the same or different species, thereby maximizing the genetic diversity of multiple 
naturally occurring genes in a directed and controllable manner. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 
at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at 
least 60.%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% or greater sequence 
identity ova: a certain defined length of one of the polypeptides. 

THE INVENTION 

In a particular embodiment, cDNA sequences derived from human tissues and cell lines were 
aligned based on nucleotide sequence identity and assembled into "consensus" or "template" sequences 
which are designated by the template identification numbers (template IDs) in column 2 of Table 1 . 
The sequence identification numbers (SEQ ID NO:s) corresponding to the template IDs are shown in 
column 1 . The template sequences have similarity to GenBank sequences, or "hits," as designated by 
the GI Numbers in column 3. The statistical probability of each GenBank hit is indicated by a 
probability score in column 4, and the functional annotation corresponding to each GenBank hit is listed 
in column 5. 

The invention incorporates the nucleic acid sequences of these templates as disclosed in the 
Sequence Listing and the use of these sequences in the diagnosis and treatment of disease states 
characterized by defects in human molecules. The invention further utilizes these sequences in 
hybridization and amplification technologies, and in particular, in technologies which assess gene 
expression pattens correlated with specific cells or tissues and their responses in vivo or in vitro to 
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pharmaceutical agents, toxins, and other treatments. In this manner, the sequences of the present 
invention are used to develop a transcript image for a particular cell or tissue. 

Derivation of Nucleic Acid Sequences 

cDN A was isolated from libraries constructed using RNA derived from normal and diseased 
human tissues and cell lines. The human tissues and cell lines used for cDN A library construction ware 
selected from a broad range of sources to provide a diverse population of cDNAs representative of gene 
transcription throughout the human body. Descriptions of the human tissues and cell lines used for 
cDNA library construction are provided in the LIFESEQ database (Incyte Genomics, Inc. (Incyte), Palo 
Alto CA). Human tissues were broadly selected from, for example, cardiovascular, dermatologic, 
endocrine, gastrointestinal, hematopoietic/immune system, musculoskeletal, neural, reproductive, and 
urologic sources. 

Cell lines used for cDNA library construction ware derived from, for example, leukemic cells, 
teratocartinomas, neuroepitheliomas, cervical carcinoma, lung fibroblasts, and endothelial cells. Such 
cell lines include, for example, THP-1, Jurkat, HUVEC, hNT2, WI38, HeLa, and other cell lines 
commonly used and available from public depositories (American Type Culture Collection, Manassas 
VA). Prior to roRNA isolation, cell lines were untreated, treated with a pharmaceutical agent such as 
5 -aza-2'-deoxycytidine> treated with an activating agent such as lipopolysaccharide in the case of 
leukocytic cell lines, or, in the case of endothelial cell lines, subjected to shear stress. 

Sequencing of the cDNAs 

Methods for DNA sequencing are well known in the art. Conventional enzymatic methods 
enqploy the Klenow fragment of DNA polymerase I, SEQUENASE DNA polymerase (U.S. 
Biochemical Corporation, Cleveland OH), Taq polymerase (Applied Biosystems, Foster City CA), 
thermostable 17 polymerase (Amersham Pharmacia Biotech, Inc. (Amersham Pharmacia Biotech), 
Piscataway NJ), or combinations of polymerases and proofreading exonucleases such as those found in 
theELONGASE amplification system (Life Technologies Inc. (Life Technologies), Gaithersburg MD), 
to extend the nucleic acid sequence from an oligonucleotide primer annealed to the DNA template of 
interest Methods have been developed for the use of both single^stranded and double-stranded 
templates. Chain termination reaction products may be electrophoresed on urea-polyacrylamide gds 
and detected either by autoradiography (for radioisotope-labeled nucleotides) or by fluorescence (for 
fluorophore-labeled nucleotides). Automated methods for mechanized reaction preparation, sequencing, 
and analysis using fluorescence detection methods have been developed Machines used to prepare 
cDNAs for sequencing can include the MICROLAB 2200 liquid transfer system (Hamilton Company 

128 



WO 01/62927 



PCT/US01/06059 



(Hamilton), Reno NV), Peltier thermal cycler (PTC200; MJ Research, Inc. (MJ Research), Watertown 
MA), and ABI CATALYST 800 thermal cycler (Applied Biosystems). Sequencing can be carried out 
using, for example, the ABI 373 or 377 (Applied Biosystems) or MEGABACE 1000 (Molecular 
Dynamics, Inc. (Molecular Dynamics), Sunnyvale CA) DNA sequencing systems, or other automated 
5 and manual sequencing systems well known in the art 

The nucleotide sequences of the Sequence Listing have been prepared by current, state-of-the- 
art, automated methods and, as such, may contain occasional sequencing errors or unidentified 
nucleotides. Such unidentified nucleotides are designated by an N. These infrequent unidentified bases 
do not represent a hindrance to practicing the invention for those skilled in the art Several methods 
l o employing standard recombinant techniques may be used to correct errors and complete the missing 
sequence information. (See, e.g., those described in Ausubel, F.M. et al. (1997) Short Protocols in 
Molecular Biology., John Wiley & Sons, New York NY; and Sambrook, J. et ai. (1989) Molecular 
Cloning, A Laboratory Manual , Cold Spring Harbor Press. Plainview NY.) 

15 Assembly of cDNA Sequences 

Human polynucleotide sequences may be assembled using programs or algorithms well known 
in the art Sequences to be assembled are related, wholly or in part, and may be derived- from a single 
or many different transcripts. Assembly of the sequences can be performed using such programs as 
PHRAP (Phils Revised Assembly Program) and the GEL VIEW fragment assembly system (GCG), or 

20 other methods known in the art. 

Alternatively, cDNA sequences are used as "component" sequences that are assembled into 
"template" or "consensus" sequences as follows. Sequence chromatograms are processed, verified, and 
quality scores are obtained using PHRED. Raw sequences are edited using an editing pathway known 
as Block 1 (See, e.g., the LBFESEQ Assembled User Guide, Incyte Genomics, Palo Alto, CA). A series 

25 of BLAST comparisons is performed and low-information segments and repetitive dements (e.g., 
dinucleotide repeats, Alu repeats, etc.) are replaced by "n's", or masked, to prevent spurious matches. 
Mitochondrial and ribosomal RNA sequences are also removed. The processed sequences are then 
loaded into a relational database management system (RDMS) which assigns edited sequences to 
existing templates, if available. When additional sequences are added into the RDMS, a process is 

3 o initiated which modifies existing templates or creates new templates from works in progress (Le. , 

nonfinal assembled sequences) containing queued sequences or the sequences themselves. After the new 
sequences have been assigned to templates, the templates can be merged into bins. If multiple templates 
exist in one bin, the,bin can be split and the templates reannotated. 
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Once gene bins have been generated based upon sequence alignments,* bins are "clone joined" 
based upon clone information. Clone joining occurs when the 5' sequence of one clone is present in one 
bin and the 3' sequence from the same done is present in a different bin, indicating that the two bins 
should be merged into a single bin. Only bins which share at least two different clones are merged. 

A resultant template sequence may contain either a partial or a full length open reading frame, 
or all or part of a genetic regulatory dement This variation is due in part to the fact that the full length 
cDNAs of many genes are several hundred, and sometimes several thousand, bases in length. With 
current technology, cDNAs comprising the coding regions of large genes cannot be cloned because of 
vector limitations, incomplete reverse transcription of the mRNA, or incomplete "second strand" 
synthesis. Template sequences may be extended to include additional contiguous sequences derived 
from the parent RNA transcript using a variety of methods known to those of skill in the art. Extension 
may thus be used to achieve the full length coding sequence of a gene. 

Analysis of the cDNA Sequences 

The cDNA sequences are analyzed using a variety of programs and algorithms which are well 
known in the art. (See, e.g., Ausubel, 1997, supra . Chapter 7.7; Meyers, R.A (Ed.) (1995) Molecular 
Biology and Biotechnology, Wiley VCH, New York NY, pp. 856-853; and Table 7.) These analyses 
comprise both reading frame determinations, e.g. , based on triplet codon periodicity for particular 
organisms (Fickett, J.W. (1982) Nucleic Acids Res. 10:5303-5318); analyses of potential start and stop 
codons; and homology searches. 

Computer programs known to those of skill in the art for performing computer-assisted 
searches for amino acid and nucleic acid sequence similarity, include, for example, Basic Local 
Alignment Search Tool (BLAST; Altschul, S.F. (1993) J. Mol. Evol. 36:290-300; Altschul, S.F. et al. 
(1990) J. Mol. Biol. 215:403-410). BLAST is especially useful in determining exact matches and 
comparing two sequence fragments of arbitrary but equal lengths, whose alignment is locally m a xim al 
and for which the alignment score meets or exceeds a threshold or cutoff score set by the user (Karlin, 
S. et aL (1988) Proc. Natl. Acad Sci. USA 85:841-845). Using an appropriate search tool (e>g., 
BLAST or HMM), GenBank, SwissProt, BLOCKS, PFAM and other databases may be searched for 
sequences containing regions of homology to a query dithp or DITHP of the present invention. 

Other approaches to the identification, assembly, storage, and display of nucleotide and 
polypeptide sequences are provided in "Relational Database for Storing Biomolecule Information," 
U.S.S.N. 08/947,845, filed October 9, 1997; "Project-Based Full-Length Biomolecular Sequence 
Database," U.S.S.N. 08/81 1,758, filed March 6, 1997; and "Relational Database and System for 
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Storing Information Relating to Biomolecular Sequences," U.S.S.N. 09/034,807. filed March 4, 1998, 
all of which are incorporated by reference heron in their entirety. 

Protein hierarchies can be assigned to the putative encoded polypeptide based on, e.g. , motif, 
BLAST, or biological analysis. Methods for assigning these hierarchies are described, for example, in 
5 "Database System Employing Protein Function Hierarchies for Viewing Biomolecular Sequence Data," 
U.S.S.N. 08/812,290, filed March 6, 1997, incorporated herein by reference. 

Identification of Human Diagnostic and Therapeutic Molecules Encoded by dithp 

The identities of the DITHP encoded by the dilhp of the present invention were obtained by 

10 analysis of the assembled cDNA sequences. 

SEQ ID NO:212, SEQ ID NO:213, SEQ ED NO:214, SEQ ID NO:215, SEQ ID NO:216, 
SEQ ID NO:217 ( SEQ ID NO;218, SEQ ID NO:219. SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, and SEQ ID NO:223, encoded by SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:3, SEQ ID 
NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ED NO:9, SEQ DD NO: 10, 

15 SEQEDNO:ll, and SEQ ED NO:12, respectively, are, for example, human enzyme molecules. 

SEQ ED NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ED NO:227, and SEQ ED NO:228, 
encoded by SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, and SEQ ED NO:17, 
respectively, are, for example, receptor molecules. 

SEQ ED NO:229, SEQ ED NO:230, SEQ ED NO:231, SEQ ED N0.232, SEQ ED NO:233, 

20 SEQ ED N0:234, SEQ ED NO:235, SEQ ED N0:236, SEQ DD NO:237, SEQ DD NO:238, SEQ DD 
NO:239, SEQ ID NO:240, and SEQ DD NO:241, encoded by SEQ DD NO:18, SEQ ID NO: 19, SEQ 
ID NO:20, SEQ DD NO:21, SEQ DD NO:22, SEQ DD NO:23, SEQ ID NO:24, SEQ DD NO:25, SEQ 
ID NO:26, SEQ ID NO;27, SEQ DD NO:28, SEQ DD NO:29, and SEQ ID NO:30, respectively, are, 
for example, intracellular signaling molecules. 

25 SEQ DD NO:242, SEQ DD NO:243, SEQ ID NO:244, SEQ DD NO:245, SEQ DD NO:246, 

SEQ DD NO:247, SEQ DD NO:248, SEQ DD NO:249, SEQ DD NO:250, SEQ DD NO:251, SEQ DD 
NO:252, SEQ ID NO:253, SEQ DD NO:254, SEQ DD NO:255. SEQ ID NO:256, SEQ DD NO:257, 
SEQ ID NO:258, SEQ DD NO:259, SEQ DD NO:260, SEQ DD NO:261, SEQ DD NO:262, SEQ DD 
NO:263, SEQ ID NO:264, SEQ DD NO:265, SEQ DD NO:266, SEQ DD NO:267, SEQ DD NO:268, 

30 SEQ DD NO:269, SEQ DD NO:270, SEQ DD NO:271, SEQ ED NO:272, SEQ DD NO:273, SEQ DD 
NO:274, SEQ ID NO:275, SEQ DD NO:276, SEQ DD NO:277, SEQ ID NO:278, SEQ DD NO:279, 
SEQ DD NO:280, SEQ DD NO:281, SEQ DD NO:282, SEQ DD NO:283, SEQ DD NO:284, SEQ DD 
NO:285, SEQ DD NO:286, SEQ DD N0.287, SEQ DD NO:288, SEQ DD NO:289, SEQ DD NO:290, 
SEQ DD NO:291, SEQ DD NO:292, SEQ DD NO:293, SEQ DD N0.294, SEQ DD NO:295. SEQ DD 
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NO:296, SEQ ID NO:297, SEQ ID NO:298, SEQ ID NO:299, SEQ ID NO:300, SEQ ID NO:301, 
SEQ ID NO:302, SEQ ID NO:303, SEQ ID NO:304, SEQ ID NO:305, SEQ ID NO:306, SEQ ID 
NO:307, SEQ ID NO.308, and SEQ ID NO:309, encoded by SEQ ID NO:31, SEQ ID NO:32, SEQ 
ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ED NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ 
5 ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ 
ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ 
ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ 
ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ 
ID NO:63, SEQ ID NO:<54, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ 
10 ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ 

t 

ID NO:75, SEQ ID NO:76, SEQ ID N0.77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ 
ID NO:81, SEQ ID NO:82, SEQ ID NO:83. SEQ ID NO:84 ; SEQ ID NO:85, SEQ ID NG:86, SEQ 
ID NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ 
ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID N0.96, SEQ ID NO:97, and SEQ ID NO:98, 

l s respectively, are, for example, transcription factor molecules. 

SEQ ID NO:310. SEQ ID NO:31 1, SEQ ID NO:312, SEQ ID NO:313, SEQ ID NO:314, 
SEQ ID NO:315, SEQ ID N0.316, and SEQ ID NO:317, encoded by SEQ ID NO:99, SEQ ID 
NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO: 103, SEQ ID NO:104, SEQ ID NO.105, 
and SEQ ID NO: 106, respectively, are, for example, membrane transport molecules. 

20 SEQ ID NO:318, SEQ ID N0.319, SEQ ID NO:320, SEQ ED NO:321, SEQ ID NO:322, 

SEQ ID N0.323, SEQ ID N0.324, SEQ ED NO:325, SEQ ED N0.326, and SEQ ED NO:327, encoded 
by SEQ ED NO:107, SEQ ED NO:108, SEQ ED NO:109, SEQ ED NO:110, SEQ ID NO:lll, SEQ ED 
NO:l 12, SEQ ED NO:l 13, SEQ ID NO:l 14, SEQ ED NO:l 15, and SEQ ED NO:l 16, respectively, are, 
for example, protein modification and maintenance molecules. 

25 SEQ ED NO:328, SEQ ED N0.329, SEQ ED NO:330, SEQ ID NO:331 , SEQ ID NO:332, 

SEQ ID NO:333, SEQ ED NO:334, SEQ ID NO:335, SEQ ED NO:336, SEQ ED N0.337, SEQ ED 
NO:338, SEQ ED N0:339, SEQ ED NO:340, and SEQ ID N0:341, encoded by SEQ ID NO:117, SEQ 
ED NO.l 18, SEQ ED NO:119, SEQ ED NO.120, SEQ ED N0.121, SEQ ED N0.122, SEQ ED NO:123, 
SEQ ED NO:124, SEQ ED NO:125, SEQ ED NO:126, SEQ ED NO:127, SEQ ED NO:128, SEQ ID 

3 o NO:129, and SEQ ED NO:130, respectively, are, for example, nucleic acid synthesis and modification 
molecules. 

SEQ ED NO:342, encoded by SEQ ED NO:131 is, for example, an adhesion molecule. 
SEQ ED NO:343, SEQ ED N0.344, SEQ ED N0.345, SEQ ED NO:346, SEQ ED NO:347, 
SEQ ED NO:348, and SEQ ED NO:349, encoded by SEQ ED NO:132, SEQ ED NO:133, SEQ ID 
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NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, and SEQ ID NO:138, respectively, are, 
for example, antigen recognition molecules. 

SEQ ID NO:350, SEQ ID NO:351, SEQ ID NO:352, and SEQ ID NO:353, encoded by SEQ 
ID NO:139, SEQ ID NO:140, SEQ ID NO:141, and SEQ ID NO:142, respectively, are, for example, 
5 electron transfer associated molecules. 

SEQ ID NO:354, SEQ ID NO:355, SEQ ID N0.356, SEQ ID NO:357, SEQ ID NO:358, and 
SEQ ID NO:359, encoded by SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO: 145, SEQ ID NO:146, 
SEQ ID N0.147, and SEQ ID NO: 148, respectively, are, for example, secreted/extracellular matrix 
molecules. 

10 SEQ ID NO:360, SEQ ID NO:361, SEQ ID NO:362, SEQ ID NO:363, SEQ ID NO:364, 

SEQ ID NO:365, SEQ ID NO:366. SEQ ID NO:367, SEQ ID NO:368, and SEQ ID NO:369, encoded 
by SEQ ID NO:149] SEQ ID NO:150, SEQ ID NO:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID 
NO:154, SEQ ID NO:155, SEQ ID N0.156, SEQ ID N0.157, and SEQ ID NO:158, respectively, 
are, for example, cytoskeletal molecules. 

15 SEQ ID NO:370, SEQ ID NO:371, SEQ ID NO:372, and SEQ ID NO:373, encoded by SEQ 

ID NO:159, SEQ ID NO:160, SEQ ID NO:161, and SEQ ID NO:162, respectively, are, for example, 
cell membrane molecules. 

SEQ ID NO:374, SEQ ID NO:375, SEQ ID NO:376, SEQ ID NO:377, SEQ ID NO:378, 
SEQ ID NO:379, SEQ ID NO:380, SEQ ID NO:381> SEQ ID NO:382, SEQ ID NO:383, SEQ ID 

20 NO:384, SEQ ID NO:385, SEQ ID NO:386", SEQ ID NO:387, SEQ ID NO:388, SEQ ID NO:389. 
SEQ ID NO:390, SEQ ID NO:391, and SEQ ID N0.392, encoded by SEQ ID NO:163, SEQ ID 
NO: 164, SEQ ID NO:165, SEQ ID NO:l<56, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, 
SEQ ED NO:170, SEQ ID N0.171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID 
NO:175, SEQ IDNO:176, SEQ ED NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, 

25 and SEQ ID NO:181, respectively, are, for example, ribosomal molecules. 

SEQ ID NO:393, SEQ ID NO:394, SEQ ID NO:395, SEQ ID NO:396, SEQ ID NO:397, 
SEQ ID NO:398, SEQ ID N0.399, SEQ ID NO:400, SEQ ID NO:401, SEQ ED NO:402, and SEQ ED 
NO:403, encoded by SEQ ED NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ED 
NO: 186, SEQ ED NO:187, SEQ ID NO:188, SEQ ID N0.189, SEQ ED NO:190, SEQ ID N0.191, 

30 and SEQ ID NO: 192, respectively, are, for example, organelle associated molecules. 

SEQ ID NO:404, SEQ ED NO:405, SEQ ED NO:406, SEQ ID NO:407, SEQ ID NO:408, 
SEQ ED NO:409, SEQ ID NO:410, SEQ ID NO:41 1, SEQ ID NO:412, SEQ ED NO:413, and SEQ ED 
N0.414, encoded by SEQ ED NO:193, SEQ ID NO:194, SEQ DD NO:195, SEQ ID NO:196, SEQ ID 
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NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, 
and SEQ ID NO:203, respectively, are; for example, biochemical pathway molecules. 

SEQ ID NO:415, SEQ ID NO:416, SEQ ID NO:417, SEQ ID NO:418, SEQ ID NO:419, 
SEQ ID NO:420, SEQ ID NO:421, and SEQ ID NO:422, encoded by SEQ ID NO:204, SEQ ID 
NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, 
and SEQ ID NO:21 1, respectively, are, for example, molecules associated with growth and 
development 

Sequences of Human Diagnostic and Therapeutic Molecules 

The dithp of the present invention may be used for a variety of diagnostic and therapeutic 
purposes. For example, a dithp may be used to diagnose a particular condition, disease, or disorder 
associated with human molecules. Such conditions, diseases, and disorders include, but are not limited 
to, a cell proliferative disorder, such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, 
dirhosis, hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, a cancer of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, 
penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an 
autoimmune/inflammatory disorder, such as inflammation, actinic keratosis, acquired 
immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, . 
allergies, ankylosing spondylitis, amyloidosis, anemia, arteriosclerosis, asthma, atherosclerosis, 
autoimmune hemolytic anemia, autoimmune thyroiditis, bronchitis, bursitis, cholecystitis, cirrhosis, 
contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, 
emphysema, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, 
Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, paroxysmal nocturnal 
hemoglobinuria, hepatitis, hypereosinophilia, irritable bowel syndrome, episodic lymphopenia with 
lymphocytotoxins, mixed connective tissue disease (MCTD), multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, myelofibrosis, osteoarthritis, osteoporosis, pancreatitis, 
polycythemia vera, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 
Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, primary 
thrombocythemia, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, 
complications of cancer, hemodialysis, and extracorporeal circulation, trauma, and hematopoietic 

cancer including lymphoma, leukemia, and myeloma; an infection caused by a viral agent classified 

■* 
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as adenovirus, arenavirus, bunyavirus, calici virus, coronavirus, filovims, hepadnavirus, herpesvirus, 
flavivirus, orthomyxovirus, parvovirus, papovavirus, paramyxovirus, picornavirus, poxvirus, reovirus, 
retrovirus, rhabdovirus, or togavirus; an infection caused by abacterial agent classified as 
pneumococcus, staphylococcus, streptococcus, bacillus, corynebacterium, Clostridium, 
meningococcus, gonococcus, listeria, moraxella, kingella, haemophilus, legionella, bordetella, gram- 
negative enterobacterium including shigella, salmonella, or Campylobacter, pseudomonas, vibrio, 
brucella, francisella, yersinia, bartonella, norcardium, actinomyces, mycobacterium, spirochaetale, 
rickettsia, chlamydia, or mycoplasma; an infection caused by a fungal agent classified as aspergillus, 
blastomyces, dermatophytes, cryptococcus, coccidioides, malasezzia, histoplasma, or other mycosis- 
causing fungal agent; and an infection caused by a parasite classified as Plasmodium or malaria- 
causing, parasitic entamoeba, leishmaoia, trypanosoma, toxoplasma, Pneumocystis carinii, intestinal 
protozoa such as giardia, trichomonas, tissue nematode such as Mchinella, intestinal nematode such 
as ascaris, lymphatic filarial nematode, trematode such as schistosoma, and cestrode such as 
tapeworm; a developmental disorder such as renal tubular acidosis, anemia, Cushing's syndrome, 
achondroplastic dwarfism, Duchenne and Becker muscular dystrophy, epilepsy, gonadal dysgenesis, 
WAGR syndrome (Wilms' tumor, aniridia, genitourinary abnormalities, and mental retardation), 
Smith-Magenis syndrome, myelodysplastic syndrome, hereditary myoepithelial dysplasia, hereditary 
keratodermas, hereditary neuropathies such as Charcot-Marie-Tooth disease and neurofibromatosis, 
hypothyroidism, hydrocephalus, seizure disorders such as Syndenham's chorea and cerebral palsy, 
spina bifida, anencephaly, craniorachischisis, congenital glaucoma, cataract, and sensorineural hearing 
loss; an endocrine disorder such as a disorder of the hypothalamus and/or pituitary resulting from 
lesions such as a primary brain tumor, adenoma, infarction associated with pregnancy, 
hypophysectomy, aneurysm, vascular malformation, thrombosis, infection, immunological disorder, and 
complication due to head trauma; a disorder associated with hypopituitarism including hypogonadism, 
Sheehan syndrome, diabetes insipidus, Kallman's disease, Hand-Schuller-Christian disease, Letterer- 
Siwe disease, sarcoidosis, empty sella syndrome, and dwarfism; a disorder associated with 
hypopituitarism including acromegaly, giantism, and syndrome of inappropriate antidiuretic hormone 
(ADH) secretion (SIADH) often caused by benign adenoma; a disorder associated with hypothyroidism 
including goiter, myxedema, acute thyroiditis associated with bacterial infection, subacute thyroiditis 
associated with viral infection, autoimmune thyroiditis (Hashimoto's disease), and cretinism; a disorder 
associated with hypothyroidism including thyrotoxicosis and its various forms, Grave's disease, 
pretibial myxedema, toxic multinodular goiter, thyroid carcinoma, and Plummer's disease; a disorder 
associated with hyperparathyroidism including Conn disease (chronic hypercalemia); a pancreatic 
disorder such as Type I or Type n diabetes mellitus and associated complications; a disorder associated 
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with the adrenals such as hypoplasia, carcinoma, or adenoma of the adrenal cortex, hypotension 
associated with alkalosis, amyloidosis, hypokalemia, Cu string's disease, Liddle's syndrome, and 
Arnold-Healy-Gordon syndrome, pheochromocytoma tumors, and Addison's disease; a disorder 
associated with gonadal steroid hormones such as: in women, abnormal prolactin production, 
infertility, endometriosis, perturbation of the menstrual cycle, polycystic ovarian disease, 
hyperprolactinemia, isolated gonadotropin deficiency, amenorrhea, galactorrhea, hermaphroditism, 
hirsutism and virilization, breast cancer, and, inpost-menopausal women, osteoporosis; and, in men, 
Leydig cell deficiency, male climacteric phase, and germinal ceil aplasia, a hypergonadal disorder 
associated with Leydig cell tumors, androgen resistance associated with absence of androgen receptors, 
syndrome of 5 a-reductase, and gynecomastia; a metabolic disorder such as Addison's disease, 
cerebrotendinous xanthomatosis, congenital adrenal hypoplasia, coumarin resistance, cystic fibrosis, 
diabetes, fatty hepatocirriiosis. fnj ctose-1, 6- diphosphatase deficiency, galactosemia, goiter, 
glucagonoma, glycogen storage diseases, hereditary fructose intolerance, hyper adrenalism, 
hypoadrenalism, hyperparathyroidism, hypoparathyroidism, hypercholesterolemia, hypothyroidism, 
hypoglycemia, hypothyroidism, hyperlipidemia, hyperlipemia, lipid myopathies, lipodystrophies, 
lysosomal storage diseases, mannosidosis, neuraminidase deficiency, obesity, pentosuria 
phenylketonuria, pseudovitamin D-deficiency rickets; disorders of carbohydrate metabolism such as 
congenital type II dyserythropoietic anemia, diabetes, insulin-dependent diabetes mellitus, 
non-insulin-dependent diabetes mellitus, fructose- 1,6-diphosphatase deficiency, galactosemia, 
glucagonoma, hereditary fructose intolerance, hypoglycemia, mannosidosis, neuraminidase 
deficiency, obesity, galactose epimerase deficiency, glycogen storage diseases, lysosomal storage 
diseases, fructosuria, pentosuria, and inherited abnormalities of pyruvate metabolism; disorders of 
lipid metabolism such as fatty liver, cholestasis, primary biliary cirrhosis, carnitine deficiency, 
carnitine palmitoyltransferase deficiency, myoadenylate deaminase deficiency, hypertriglyceridemia, 
lipid storage disorders such Fabry's disease, Gaucher's disease, Niemann-Pick's disease, 
metachromatic leukodystrophy, adrenoleukodystrophy, GMj gangliosidosis, and ceroid 
lipofuscinosis, abetalipoproteinemia, Tangier disease, hyperlipoproteinemia, diabetes mellitus, 
lipodystrophy, lipomatoses, acute panniculitis, disseminated fat necrosis, adiposis dolorosa, lipoid 
adrenal hypoplasia, minimal change disease, lipomas, atherosclerosis, hypercholesterolemia, 
hypercholesterolemia with hypertriglyceridemia, primary hypoalphahpopxoteanemia, hypothyroidism, 
renal disease, liver disease, lecithin:cholesterol acyltransferase deficiency, cerebrotendinous 
xanthomatosis, sitosterolemia, hypocholesterolemia, Tay-Sachs disease, Sandhoff s disease, 
hyperlipidemia, hyperlipemia, lipid myopathies, and obesity; and disorders of copper metabolism 
such as Menke's disease, Wilson's disease, andEhlers-Danlos syndrome type IX; a neurological 
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disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's 
disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal 
disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular 
atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, 
bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative 
intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion 
diseases including kuru, Creutzfeldt- Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, 
fatal familial insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, 
tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental 
retardation and other developmental disorder of the central nervous system, cerebral palsy, a 
neuroskeletal disorder, an autonomic nervous system disorder, a cranial nerve disorder, a spinal cord 
disease, muscular dvstrophy and other neuromuscular disorder, a peripheral nervous system disorder, 
dermatomyositis and polymyositis, inherited, metabolic, endocrine, and toxic myopathy, myasthenia 
gravis, periodic paralysis, a mental disorder including mood, anxiety, and schizophrenic disorders, 
seasonal affective disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, tardive 
dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, and Tourette's disorder; a 
gastrointestinal disorder including ulcerative colitis, gastric and duodenal ulcers, cystinuria, 
dibasicaminoaciduria, hypercystinuria, lysinuria, hartnup disease, tryptophan malabsorption, 
methionine malabsorption, histidinuria, iminoglycinuria, dicaiboxylicaminoaciduria, cystinosis, renal 
glycosuria, hypouricemia, familial hypophophatemic rickets, congenital chloridonhea, distal renal 
tubular acidosis, Menkes' disease, Wilson's disease, lethal dianhea, juvenile pernicious anemia, 
folate malabsorption, adrenoleukodystrophy, hereditary myoglobinuria, and Zellweger syndrome; a 
transport disorder such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, 
Becker's muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes meUitus, diabetes 
insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, 
nonnokalemic periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, 
myasthenia gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral 
neuropathy, cerebral neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., 
angina, bradyanythmia, tachyarrythmia, hypertension, Long QT syndrome, myocarditis, 
cardiomyopathy, nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial 
myopathy, thyrotoxic myopathy, ethanol myopathy, dermatomyositis, inclusion body myositis, 
infectious myositis, and polymyositis, neurological disorders associated with transport, e.g., 
Alzheimer's disease, amnesia, bipolar disorder, dementia, depression, epilepsy, Tourette's disorder, 
paranoid psychoses, and schizophrenia, and other disorders associated with transport, e.g., 
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neurofibromatosis, postherpetic neuralgia, trigeminal neuropathy, sarcoidosis, sickle cell anemia, 
cataracts, infertility, pulmonary artery stenosis, sensorineural autosomal deafness, hyperglycemia, 
hypoglycemia, Grave's disease, goiter, glucose-galactose malabsorption syndrome, 
hypercholesterolemia, (Hushing* s disease, and Addison's disease; and a connective tissue disorder 
such as osteogenesis imperfecta, Ehlers-Danlos syndrome, chondrodysplasias, Marfan syndrome, 
Alport syndrome, familial aortic aneurysm, achondroplasia, mucopolysaccharidoses, osteoporosis, 
osteopetrosis, Paget' s disease, rickets, osteomalacia, hyperparathyroidism, renal osteodystrophy, 
osteonecrosis, osteomyelitis, osteoma, osteoid osteoma, osteoblastoma, osteosarcoma, 
osteochondroma, chondroma, chondroblastoma, chondromyxoid fibroma, chondrosarcoma, fibrous 
cortical defect, nonossifying fibroma, fibrous dysplasia, fibrosarcoma, malignant fibrous 
histiocytoma, Ewing's sarcoma, primitive neuroectodermal tumor, giant cell tumor, osteoarthritis, 
rheumatoid arthritis, ankylosing spondylarthritis, Reiter's syndrome, psoriatic arthritis, enteropathy 
arthritis, infectious arthritis, gout, gouty arthritis, calcium pyrophosphate crystal deposition disease, 
ganglion, synovial cyst, villonodular synovitis, systemic sclerosis, Dupuytren's contracture, hepatic 
fibrosis, lupus erythematosus, mixed connective tissue disease, epidermolysis bullosa sinqplex, bullous 
congenital ichthyosiform erythroderma (epidermolytic hyperkeratosis), non-epidermolytic and 
epidermolytic palmpplamar keratoderma, ichthyosis bullosa of Siemens, pachyonychia congenita, and 
white sponge nevus. The dithp can be used to detect the presence of, or to quantify the amount of, a 
dithp-reJated polynucleotide in a sample. This information is then compared to information obtained 
from appropriate reference samples, and a diagnosis is established. Alternatively, a polynucleotide 
complementary to a given dithp can inhibit or inactivate a therapeutically relevant gene related to the 
dithp. 

Analysis of dithp Expression Patterns 

The expression of dithp may be routinely assessed by hybridization-based methods to 
determine, for example, the tissue-specificity, disease-specificity, or developmental stage-specificity of 
dithp expression. For example, the level of expression of dithp may be compared among different cell 
types or tissues, among diseased and normal cell types or tissues, among cell types or tissues at 
different developmental stages, or among cell types or tissues undergoing various treatments. This type 
of analysis is useful, for example, to assess the relative levels of dithp expression in fully or partially 
differentiated cells or tissues, to determine if changes in dithp expression levels are correlated with the 
development or progression of specific disease states, and to assess the response of a cell or tissue to a 
specific therapy, for example, in pharmacological or toxicological studies. Methods for the analysis of 
dithp expression are based on hybridization and amplification technologies and include membrane- 
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based procedures such as northern blot analysis, high-throughput procedures that utilize, for example, 
microarrays, and PCR-based procedures. 

Hybridization and Genetic Analysis 
5 The dithp, their fragments, or complementary sequences, may be used to identify the presence 

of and/or to determine the degree of similarity between two (or more) nucleic acid sequences . Hie dithp 
may be hybridized to naturally occurring or recombinant nucleic acid sequences under appropriately 
selected temperatures and salt concentrations. Hybridization with a probe based on the nucleic acid 
sequence of at least one of the dithp allows for the detection of nucleic acid sequences, including 

1 o genomic sequences, which are identical or related to the dithp of the Sequence Listing. Probes may be 

selected from non-conserved or unique regions of at least one of the polynucleotides of SEQ ID NO:l- 
21 1 and tested for their ability to identify or amplify the target nucleic acid sequence using standard 
protocols. 

Polynucleotide sequences that are capable of hybridizing, in particular, to those shown in SEQ 
is ID NO.1-21 1 and fragments thereof, can be identified using various conditions of stringency. (See, 
e.g„ Wahl, G.M. and S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) 
Methods Enzymol. 152:507-5 1 1 .) Hybridization conditions are discussed in "Definitions." 

A probe for use in Southern or northern hybridization may be derived from a fragment of a 
dithp sequence, or its complement, that is up to several hundred nucleotides in length and is either 
20 single-stranded or double-stranded Such probes may be hybridized in solution to biological materials 
such as plasmids, bacterial, yeast, or human artificial chromosomes, cleared or sectioned tissues, or to 
artificial substrates containing dithp. Microarrays are particularly suitable for identifying the presence 
of and detecting the level of expression for multiple genes of interest by examining gene expression 
correlated with, e.g., various stages of development, treatment with a drag or compound, or disease 

2 5 progression. An array analogous to a dot or slot blot may be used to arrange and link polynucleotides 

to the surface of a substrate using one or more of the following: mechanical (vacuum), chemical, 
. thermal, or UV bonding procedures. Such an array may contain any number of dithp and may be 
produced by hand or by using available devices, materials, and machines. 

Microarrays may be prepared, used, and analyzed using methods known in the art (See, e.g. , 
30 Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 
USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/25 11 16; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A et al. (1997) Proc. Natl. Acad Sci. USA 94:2150- 
2155; and Heller, M. J. et al. (1997) U.S. Patent No. 5,605,662.) 

Probes may be labeled by either PCR or enzymatic techniques using a variety of commercially 
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available reporter molecules. For example, commercial kits are available for radioactive and 
chemttuminescent labeling (Amersham Pharmacia Biotech) and for alkaline phosphatase labelling (Life 
Technologies). Alternatively, dithp may be cloned into commercially available vectors for the 
production of RNA probes. Such probes may be transcribed in the presence of at least one labeled 
5 nucleotide (e.g., 32 P-ATP, Aroersham Pharmacia Biotech). 

Additionally the polynucleotides of SEQ ID NO: 1-211 or suitable fragments thereof can be 
used to isolate full length cDNA sequences utilizing hybridization and/or amplification procedures well 
known in the art, e.g., cDNA library screening, PCR amplification, etc. The molecular cloning of such 
full length cDN A sequences may employ the method of cDNA library screening with probes using the 

1 o hybridization, stringency, washing, and probing strategies described above and in AusubeJ, supra . 

Chapters 3, 5, and 6. These procedures may also be employed with genomic libraries to isolate 
genomic sequences of dithp in order to analyze, e.g., regulatoiy dcmcats. 

Genetic Mapping 

is Gene identification and mapping are important in the investigation and treatment of almost an 

conditions, diseases, and disorders. Cancer, cardiovascular disease, Alzheimer's disease, arthritis, 
diabetes, and mental illnesses are of particular interest Each of these conditions is more complex than 
the single gene defects of sickle cell anemia or cystic fibrosis, with select groups of genes being 
predictive of predisposition for a particular condition, disease, or disorder. For example, 

2 o cardiovascular disease may result from malfunctioning receptor molecules that fail to clear cholesterol 

from the bloodstream, and diabetes may result when a particular individual's immune system is 
activated by an infection and attacks the insulin-producing cells of the pancreas. In some studies, 
Alzheimer's disease has been linked to a gene on chromosome 21 ; other studies predict a different gene 
and location. Mapping of disease genes is a complex and reiterative process and generally proceeds 
25 from genetic linkage analysis to physical mapping. 

As a condition is noted among members of a family, a genetic linkage map traces parts of 
chromosomes that are inherited in the same pattern as the condition. Statistics link the inheritance of 
particular conditions to particular regions of chromosomes, as defined by RFLP or other markers. 
(See, for example, Lander, E. S. and Botstein, D. (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) 

3 o Occasionally, genetic markers and their locations are known from previous studies. More often, 

however, the markers are simply stretches of DNA that differ among individuals. Examples of genetic 
linkage maps can be found in various scientific journals or at the Online Mendelian Inheritance in Man 
(OMIM) World Wide Web site. 

In another embodiment of the invention, dithp sequences may be used to generate hybridization 
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probes useful in chromosomal mapping of naturally occurring genomic sequences. Either coding or 
noncoding sequences of dithp may be used, and in some instances, noncoding sequences may be 
preferable ova: coding sequences. For example, conservation of a dithp coding sequence among 
members of a multi-gene family may potentially cause undesired cross hybridization during 
5 chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific region 
of a chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes 
(HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) Nat 
Genet 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet 
10 7:149-154.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical chromosome 
mapping techniques and genetic man data, (See, e.g., Meyers, supra, pp. 965-968.) Correlation 
between the location of dithp on a physical chromosomal map and a specific disorder, or a 
predisposition to a specific disorder, may help define the region of DNA associated with that disorder. 

15 The dithp sequences may also be used to detect polymorphisms that are genetically linked to the 
inheritance of a particular condition, disease, or disorder. 

In situ hybridization of chromosomal preparations and genetic mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending existing genetic 
maps. Often the placement of a gene on the chromosome of another mammalian species, such as 

2 o mouse, may reveal associated markers even if the number or arm of the corresponding human 

chromosome is not known. These new marker sequences can be mapped to human chromosomes and 
may provide valuable information to investigators searching for disease genes using positional cloning 
or other gene discovery techniques. Once a disease or syndrome has been crudely correlated by genetic 
linkage with a particular genomic region, e.g„ ataxia-telangiectasia to llq22-23, any sequences 

2 5 mapping to that area may represent associated or regulatory genes for further investigation. (See, e,g„ 

Gatti, R. A et al. (1988) Nature 336:577-580.) The nucleotide sequences of the subject invention may 
also be used to detect differences in chromosomal architecture due to translocation, inversion, etc., 
among normal, carrier, or affected individuals. 

Once a disease-associated gene is mapped to a chromosomal region, the gene must be cloned in 

3 o order to identify mutations or other alterations (e.g., translocations or invasions) that may be correlated 

with disease. This process requires a physical map of the chromosomal region containing the disease- 
gene of interest along with associated markers. A physical map is necessary for determining the 
nucleotide sequence of and order of marker genes on a particular chromosomal region Physical 
mapping techniques are well known in the art and require the generation of overlapping sets of cloned 
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DNA fragments from a particular organelle, chromosome, or genome. These clones are analyzed to 
reconstruct and catalog their order. Once the position of a marker is determined, the DNA from that 
region is obtained by consulting the catalog and selecting clones from that region. Hie gene of interest 
is located through positional cloning techniques using hybridization or similar methods. 

Diagnostic Uses 

The dithp of the present invention may be used to design probes useful in diagnostic assays. 
Such assays, well known to those skilled in the art, may be used to detect or confirm conditions, 
disorders, or diseases associated with abnormal levels of dithp expression Labeled probes developed 
from dithp sequences are added to a sample under hybridizing conditions of desired stringency. In some 
instances, dithp, or fragments or oligonucleotides derived from dithp, may be used as primers in 
amplification steps prior to hybridization. Hie amount of hybridization complex funned is quantified 
and compared with standards for that cell or tissue. If dithp expression varies significantly from the 
standard, the assay indicates the presence of the condition, disorder, or disease. Qualitative or 
quantitative diagnostic methods may include northern, dot blot, or other membrane or dip-stick based 
technologies or multiple-sample format technologies such as PCR, enzyme-linked immunosorbent assay 
.(ELISA)-like, pin, or chip-based* assays. 

The probes described above may also be used to monitor the progress of conditions, disorders, 
or diseases associated with abnormal levels of dithp expression, or to evaluate the efficacy of a 
particular therapeutic treatment The candidate probe may be identified from the dithp that are specific 
to a given human tissue and have not been observed in GenBank or other genome databases. Such a 
. probe may be used in animal studies, preclinical tests, clinical trials, or in monitoring the treatment of 
an individual patient In a typical process, standard expression is established by methods well known in 
file art for use as a basis of comparison, samples from patients affected by the disorder or disease are 
combined with the probe to evaluate any deviation from the standard profile, and a therapeutic agent is 
administered and effects are monitored to generate a treatment profile. Efficacy is evaluated by 
determining whether the expression progresses toward or returns to the standard normal patten. 
Treatment profiles may be generated over a period of several days or several months. Statistical 
methods well known to those skilled in the art may be use to determine the significance of such 
therapeutic agents. 

Hie polynucleotides are also useful for identifying individuals from minute biological sauries, 
for example, by matching the RFLP pattern of a sample's DNA to that of an individual's DNA. The 
polynucleotides of the present invention can also be used to determine the actual base-by-base DNA 
sequence of selected portions of an individual's genome. These sequences can be used to prepare PCR 
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primers for amplifying and isolating such selected DNA, which can then be sequenced. Using this 
technique, an individual can be identified through a unique set of DNA sequences. Once a unique ID 
database is established for an individual, positive identification of that individual can be made from 
extremely small tissue samples. 

In a particular aspect, oligonucleotide primers derived from the dithp of the invention may be 
used to detect single nucleotide polymorphisms (SNPs). SNPs are substitutions, insertions and 
deletions that are a frequent cause of inherited or acquired genetic disease in humans. Methods of SNP 
detection include, but are not limited to, single-stranded conformation polymorphism (SSCP) and 
fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers derived from dithp are used to 
amplify DNA using the polymerase chain reaction (PCR). The DNA may be derived, for example, 
from diseased or normal tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause 
differences in the secondary and tertiary structures of PCR products in singie-siranded form, and these 
differences are detectable using gel electrophoresis in non-denaturing gels. In fSCCP, the 
oligonucleotide primers are fluorescenily labeled, which allows detection of the amplimers in high- 
throughput equipment such as DNA sequencing machines. Additionally, sequence database analysis 
methods, termed in silico SNP (isSNP), are capable of identifying polymorphisms by comparing the 
sequences of individual overlapping DNA fragments which assemble into a common consensus 
sequence. These computer-based methods filter out sequence variations due to laboratory preparation 

of DNA and sequencing errors using statistical models and automated analyses of DNA sequence 

v. 

chromatograms. In the alternative, SNPs may be detected and characterized by mass spectrometry 
using, for example, the high throughput MASSARRAY system (Sequenom, Inc., San Diego CA). 

DNA-based identification techniques are critical in forensic technology. DNA sequences taken 
from very small biological samples such as tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, 
semen, etc., can be amplified using, e.g., PCR, to identify individuals. (See, e.g., Erlich, H. (1992) 
PCR Technology . Freeman and Co., New York, NY). Similarly, polynucleotides of the present 
invention can be used as polymorphic markers. 

There is also a need for reagents capable of identifying the source of a particular tissue. 
Appropriate reagents can comprise, for example, DNA probes or primers prepared from the sequences 
of the present invention that are specific for particular tissues. Panels of such reagents can identify 
tissue by species and/or by organ type. In a similar fashion, these reagents can be used to screen tissue 
cultures for contamination 

The polynucleotides of the present invention can also be used as molecular weight markers on 
nucleic acid gels or Southern blots, as diagnostic probes for the presence of a specific roRNA in a 
particular cell type, in the creation of subtracted cDN A libraries which aid in the discovery of novel 
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polynucleotides, in selection and synthesis of oligomers for attachment to an array or other support, and 
as an antigen to elicit an immune response. 

Disease Model Systems Using dithp 
5 The dithp of the invention or their mammalian homologs may be "knocked out" in an animal 

model system using homologous recombination in embryonic stem (ES) cells. Such techniques are well 
known in the art and are useful for the generation of animal models of human disease. (See, e.g., U.S. 
Patent Number 5,175,383 and U.S. Patent Number 5,767,337.) For example, mouse ES cdls, such as 
the mouse 129/SvJ cell line, are derived from the early mouse embryo and grown in culture. Hie ES 

1 o cells are transformed with a vector containing the gene of interest disrupted by a marker gene, e.g„ the 

neomycin phosphotransferase gene (neo; Capecchi, MR, (1989) Science 244:1288-1292). The vector 
integrates into the corresponding region of the host genome by homologous recombination. 
Alternatively, homologous recombination takes place using the Cre-loxP system to knockout a gene of 
interest in a tissue- or developmental stage-specific manner (Marth, J.D. (1996) Clin. Invest 97:1999- 

15 2002; Wagner, K.U. et al. (1997) Nucleic Acids Res. 25:4323-4330). Transformed ES cells are 

identified and microinjected into mouse cell blastocysts such as those from the C57BL/6 mouse strain. 
The blastocysts are surgically transferred to pseudopregnant dams, and the resulting chimeric progeny 
are genotyped and bred to produce heterozygous or homozygous strains. Transgenic animals thus 
generated may be tested with potential therapeutic or toxic agents. 

20 The dithp of the invention may also be manipulated in vitro in ES cells derived from human 

blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell lineages 
including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate into, for 
example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. (1998) Science 
282:1145-1147). 

2 5 The dithp of the invention can also be used to create "knockin" humanized animals (pigs) or 

transgenic animals (mice or rats) to model human disease. With knockin technology, a region of dithp 
is injected into animal ES cells, and the injected sequence integrates into the animal cell genome. 
Transformed cells are injected into blastulae, and the blastulae are implanted as described above. 
Transgenic progeny or inbred lines are studied and treated with potential pharmaceutical agents to 

3 o obtain information on treatment of a human disease. Alternatively, a mammal inbred to overexpress 

dithp, resulting, e.g., in the secretion of DITHP in its milk, may also serve as a convenient source of 
that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 

Screening Assays 
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DITHP encoded by polynucleotides of the present invention may be used to screen for 
molecules that bind to or are bound by the encoded polypeptides. The binding of the polypeptide and 
the molecule may activate (agonist), increase, inhibit (antagonist), or decrease activity of the 
polypeptide or the bound molecule. Examples of such molecules include antibodies, oligonucleotides, 
proteins (&g„ receptors), or small molecules. 

Preferably, the molecule is closely related to the natural ligand of the polypeptide, &g., a ligand 
or fragment thereof, a natural substrate, or a structural or functional mimetic. (See, Coligan est al., 
(1991) Current Prot nmis m imm unology 1(2): Chapter 5.) Similarly, the molecule can be closely 
related to the natural receptor to which the polypeptide binds, or to at least a fragment of the receptor, 
e.g., the active site. In either case, the molecule can be rationally designed using known techniques. 
Preferably, the screening for these molecules involves producing appropriate cells which express the 
polypeptide, either as a secreted protein or on the cell membrane. Preferred cdls include ceils from 
mammals, yeast, Drosophila . or E. coli . Cells expressing the polypeptide or cell membrane fractions 
which contain the expressed polypeptide are thai contacted with a test compound and binding, 
stimulation, or inhibition of activity of either the polypeptide or the molecule is analyzed. 

An assay may simply test binding of a candidate compound to the polypeptide, wherein binding 
is detected by a fluorophore, radioisotope, enzyme conjugate* or other detectable label. Alternatively, 
the assay may assess binding in the presence of a labeled competitor. 

Additionally, the assay can be carried out using cell-free preparations, polypeptide/molecule 
affixed to a solid support, chemical libraries, or natural product mixtures. The assay may also, simply 

■■i 

comprise the steps of mixing a candidate compound with a solution containing a polypeptide* measuring 
polypeptide/molecule activity or binding, and comparing the polypeptide/molecule activity or binding to 
a standard. 

Preferably, an ELISA assay using, e>g., a monoclonal or polyclonal antibody, can measure 
polypeptide level in a sample. The antibody can measure polypeptide level by either binding, directly or 
indirectly, to the polypeptide or by competing with the polypeptide for a substrate. 

All of the above assays can be used in a diagnostic or prognostic contort. The molecules 
discovered using these assays can be used to treat disease or to bring about a particular result in a 
patient (e.g., blood vessel growth) by activating or inhibiting the polypeptide/molecule. Moreover, the 
assays can discover agents which may inhibit or enhance the production of the polypeptide from 
suitably manipulated cells or tissues. 

Transcript Imaging and Toxicological Testing 

Another embodiment relates to the use of dithp to develop a transcript image of a tissue or cell 
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type. A transcript image represents the global pattern of gene expression by a particular tissue or cell 
type. Global gene expression patterns are analyzed by quantifying the number of expressed genes and 
their relative abundance under given conditions and at a given time, (See SeOhamer et aL, 
"Comparative Gene Transcript Analysis," U.S. Patent Number 5,840,484, expressly incorporated by 
reference herein.) Thus a transcript image may be generated by hybridizing the polynucleotides of the 
present invention or their complements to the totality of transcripts or reverse transcripts of a particular 
tissue or cell type. In one embodiment, the hybridization takes place in high-throughput format, 
wherein the polynucleotides of the present invention or their complements comprise a subset of a 
plurality of elements on a microarray. The resultant transcript image would provide a profile of gene 
activity pertaining to human molecules for diagnostics and therapeutics. 

Transcript images which profile dithp expression may be generated using transcripts isolated 
from tissues, cell lines, biopsies, or other biological samples. The transcript image may thus reflect 
dithp expression in vivo, as in the case of a tissue or biopsy sample, or in vitro, as in the case of a cell 
line. 

Transcript images which profile dithp expression may also be used in conjunction with in vitro 
model systems and preclinical evaluation of pharmaceuticals, as well as toxicological testing of 
industrial and naturally-occurring environmental compounds. All compounds induce characteristic 
gene expression patterns, frequently termed molecular fingerprints or toxicant signatures, which are 
indicative of mechanisms of action and toxicity (Nuwaysir, E. F. et al. (1999) Mol. Carcinbg. 24:153- 
159; Steiner, S. and Anderson, NX. (2000) Toxicol. Lett. 112-1 13:467-71, expressly incorporated by 
reference herein). If a test compound has a signature similar to that of a compound with known 
toxicity, it is likely to share those toxic properties. These fingerprints or signatures are most useful and 
refined when they contain expression information from a large number of genes and gene families. 
Ideally, a genome-wide measurement of expression provides the highest quality signature. Even genes 
whose expression is not altered by any tested compounds are important as well, as the levels of 
expression of these genes are used to normalize the rest of the expression data. The normalization 
procedure is useful for comparison of expression data after treatment with different compounds. While 
the assignment of gene function to elements of a toxicant signature aids in interpretation of toxicity 
mechanisms, knowledge of gene function is not necessary for the statistical matching of signatures 
which leads to prediction of toxicity. (See, for example, Press Release 00-02 from the National 
Institute of Environmental Health Sciences, released February 29, 2000, available at 
httpy/ww.niehs.nih.gov/o<VneWtoxchip.htm) Therefore, it is important and desirable in 
toxicological screening using toxicant signatures to include all expressed gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological sample 
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containing nucleic acids with the test compound. Nucleic acids that are expressed in the treated 
biological sample are hybridized with one or more probes specific to the polynucleotides of the 
present invention, so that transcript levels corresponding to the polynucleotides of the present 
invention may be quantified. The transcript levels in the treated biological sample are compared with 
levels in an untreated biological sample. Differences in the transcript levels between the two samples 
are indicative of a toxic response caused by the test compound in the treated sample. 

Another particular embodiment relates to the use of DITHP encoded by polynucleotides of the 
present invention to analyze the proteome of a tissue or cell type. The term proteome refers to the 
global patten of protein expression in a particular tissue or cell type. Each protein component of a 
proteome can be subjected individually to further analysis, Proteome expression patterns, or profiles, 
are analyzed by quantifying the number of expressed proteins and their relative abundance undo* given 
conditions and at a given time* A profile of a cell's proteome may thus be generated by separating and 
analyzing the polypeptides of a particular tissue or cell type. In one embodiment, the separation is 
achieved using two-dimensional gel electrophoresis, in which proteins from a sample are separated by 
isoelectric focusing in the first dimension, and then according to molecular weight by sodium dodecyl 
sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, supra) . The proteins are 
visualized in the gel as discrete and uniquely positioned spots, typically by staining the gd with an 
agent such as Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot is 
generally proportional to the level of the proton in the sample. The optical densities of equivalency 
positioned protein spots from different samples, for example, from biological samples either treated or 
untreated with a test compound or therapeutic agent, are compared to identify any changes in protein 
spot density related to the treatment The proteins in the spots are partially sequenced using, for 
example, standard methods employing chemical or enzymatic cleavage followed by mass spectrometry. 
The identity of the protein in a spot may be determined by comparing its partial sequence, preferably of 
at least 5 contiguous amino acid residues, to the polypeptide sequences of the present invention. In 
some cases, further sequence data may be obtained for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for DITHP to quantify the 
levels of DITHP expression. In one embodiment, the antibodies are used as elements on a microarray, 
and protein expression levels are quantified by exposing the microarray to the sample and detecting the 
levels of protein bound to each array element (Lueking, A. et al. (1999) Anal. Biochem. 270:103-11; 
Mendoze, L.G. et al. (1999) Biotechniques 27:778-88). Detection may be performed by a variety of 
methods known in the art, for example, by reacting the proteins in the sample with a thiol- or amino- 
reactive fluorescent compound and detecting the amount of fluorescence bound at each array element 

Toxicant signatures at the proteome level are also useful for toxicological screening, and should 
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be analyzed in parallel with toxicant signatures at the transcript level. TCiere is a poor correlation 
between transcript and protein abundances for some proteins in some tissues (Anderson, N.L. and 
Seflhamer, J. (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be useful in the 
analysis of compounds which do not significantly affect the transcript image, but which alter the 
proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 
degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated biological 
sample are separated so that the amount of each protein can be quantified. The amount of each protein 
is compared to the amount of the corresponding protein in an untreated biological sample. A difference 
in the amount of protean between the two samples is indicative of a toxic response to the test compound 
in the treated sample. Individual proteins are identified by sequencing the amino add residues of the 
individual proteins and comparing these partial sequences to the DITHP encoded by polynucleotides of 
the present invention. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins from the biological sample are incubated 
with antibodies specific to the DITHP encoded by polynucleotides of the present invention. The amount 
of protein recognized by the antibodies is quantified The amount of protein in the treated biological 
sample is compared with the amount in an untreated biological sample. A difference in the amount of 
protein between the two samples is indicative of a toxic response to the test compound in the treated 
sample. 

Transcript images may be used to profile dithp expression in distinct tissue types. This process 
can be used to determine human molecule activity in a particular tissue type relative to this activity in a 
different tissue type. Transcript images may be used to generate a profile of dithp expression 
characteristic of diseased tissue. Transcript images of tissues before and after treatment may be used 
for diagnostic purposes, to monitor the progression of disease, and to monitor the efficacy of drug 
treatments for diseases which affect the activity of human molecules. 

Transcript images of cell lines can be used to assess human molecule activity and/or to identify 
cell lines that lack or misregulate this activity. Such ceil lines may then be treated with pharmaceutical 
agents, and a transcript image following treatment may indicate the efficacy of these agents in restoring 
desired levels of this activity. A similar approach may be used to assess the toxicity of pharmaceutical 
agents as reflected by undesirable changes in human molecule activity. Candidate pharmaceutical 
agents may be evaluated by comparing their associated transcript images with those of pharmaceutical 
agents of known effectiveness. 
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Antisense Molecules 

Hie polynucleotides of the present invention are useful in antisense technology. Antisense 
technology or therapy relies on the modulation of expression of a target protein through the specific 
binding of an antisense sequence to a target sequence encoding the target protein or directing its 
expression. (See, e.g., Agrawal, S., edL (1996) Antisense Therapeutics, Humana Press Inc., Totawa 
NJ; Alama, A. et al. (1997) Pharmacol. Res. 36(3):17M78; Crooke, S.T. (1997) Adv. Pharmacol. 
40:1-49; Sharma, H.W. andR. Narayanan (1995) Bioessays 17(12):1055-1063; andLavrosky, Y. et 
al. (1997) Biochem. Mol. Med 62(1):1 1-22.) An antisense sequence is a polynucleotide sequence 
capable of specifically hybridizing to at least a portion of the target sequence. Antisense sequences 
bind to cellular mRNA and/or genomic DNA, affecting translation and/or transcription. Antisense 
sequences can be DNA, RNA, or nucleic acid mimics and analogs. (See, e.g., Rossi, J.J. et al. (1991) 
Antisense Res. Dev. l(3):285-288; Lee, R. et al. (1998) Biochemistry 37(3):900-1010; Pardridge, 
W.M. et al. (1995) Proc. Natl. Acad. Sci. USA 92(12):5592-5596; and Nielsen, P. E. and Haaima, G. 
(1997) Chem. Soc. Rev. 96:73-78.) Typically, the binding which results in modulation of expression 
occurs through hybridization or binding of complementary base pairs. Antisense sequences can also 
bind to DNA duplexes through specific interactions in the major groove of the double helix. 

The polynucleotides of the present invention and fragments thereof can be used as antisense 
sequences to modify the expression of the polypeptide encoded by dithp. The antisense sequences can 
be produced ex vivo , such as by using any of the ABI nucleic acid synthesizer series (Applied 
Biosystems) or other automated systems known in the art. Antisense sequences can also be produced 
biologically, such as by transforming an appropriate host cell with an expression vector containing the 
sequence of interest. (See, e.g., Agrawal, supra .) 

In therapeutic use> any gene delivery system suitable for introduction of the antisense sequences 
into appropriate target cells can be used Antisense sequences can be delivered intracdlularly in the 
form of an expression plasmid which, upon transcription, produces a sequence complementary to at 
least a portion of the cellular sequence encoding the target protein. (See, eg., Slate, J.E., et al. (1998) 
J. Allergy Clin Immunol. 102(3):469-475; and Scanlon, KJ., et al. (1995) 9(13):1288-1296.) 
Antisense sequences can also be introduced intracdlularly through the use of viral vectors, such as 
retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A.D. (1990) Blood 76:271; Ausubel, 
F.M. et al. (1995) Current Protocols in Molecular Biology . John Wiley & Sons, New York NY; Uckert, 
W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other gene delivery mechanisms include 
liposome-derived systems, artificial viral envelopes, and other systems known in the art. (See, e.g., 
Rossi, J.J. (1995) Br. Med Bull. 51(l):217-225; Boado, RJ. et al. (1998) J. Phann Sci. 87(11):1308- 
1315; and Morris, M.C. et al. (1997) Nucleic Acids Res. 25(14):2730-2736.) 
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Expression 

la order to express a biologically active DITTO, the nucleotide sequences encoding DITHP or 
fragments thereof m$y be inserted into an appropriate expression vector, i.e., a vector which contains 
the necessary dements for transcriptional and translational control of the inserted coding sequence in a 
5 suitable host. Methods which are well known to those skilled in the art may be used to construct 
expression vectors containing sequences encoding DITHP and appropriate transcriptional and 
translational control dements. These methods include in vitro recombinant DNA techniques, synthetic 
techniques, and in vivo genetic recombination. (See, e.g., Sambrook, supra . Chapters 4, 8, 16, and 17; 
and Ausubel, supra , Chapters 9, 10, 13, and 16.) 

10 A variety of expression vector/host systems may be utilized to contain and express sequences 

encoding DITHP. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 

15 tobacco mosaic virus, TMV) or with bacterial expression vectors (e,g., Ti or pBR322 plasmids); or 
animal (mammalian) cell systems. (See, e.g., Sambrook, supra ; Ausubel, 1995, supra . Van Heeke, 0. 
and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509; Bitter, G.A et al. (1987) Methods Enzymol. 
153:516-544; Scorer, C.A et al. (1994) Bio/Technology 12:181-184; Engelhard, E.K. et al. (1994) 
Proa Natl Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. GeneTher; 7:1937-1945; 

20 Takamatsu, N. (1987) EMBO J. 6:307-311; Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, 
R. et al. (1984) Science 224:838-843; Winter, I et al. (1991) Results Probl. Cell Differ. 17:85-105; 
Hie McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 
191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and Harrington, 
J.J. et al. (1997) Nat Genet. 15:345-355.) Expression vectors derived from retroviruses, adenoviruses, 

25 or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for delivery of nucleotide 
sequences to the targeted organ, tissue* or cell population. (See, e.g., Di Nicola, M. et al. (1998) 
Cancer Gen. Ther. 5(6):350-356; Yu, M. et al., (1993) Proc. Natl. Acad ScL USA90(13):6340-6344; 
Buller, R.M. et al. (1985) Nature 317(6040):813-815; McGregor, D.P. et al. (1994) Mol. Immunol. 
31(3):219-226; and Verma, LM. and N. Somia (1997) Nature 389:239-242.) The invention is not 

3 o limited by the host cell employed. 

For long term production of recombinant proteins in mammalian systems, stable expression of 
DITHP in cell lines is preferred For example, sequences encoding DITHP can be transformed into cell 
lines using expression vectors which may contain viral origins of replication and/or endogenous 
expression elements and a selectable marker gene on the same or on a separate vector. Any number of 
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selection systems may be used to recover transformed cell lines. (See, e.g., Wigler, M. et aL (1977) 
Cdl 11:223-232; Lowy, L et al. (1980) Cell 22:817-823.; Wigler, M. et aL (1980) Proc. Natl. Acad. 
Sci. USA 77:3567-3570; Colbere-Garapin, F. et aL (1981) J. Mol. Biol. 150:1-14; Hartman, S.C. and 
R.CMuIligan(1988) Proc. NatL Acad. Sci. USA 85:8047-8051; Rhodes, C.A. (1995) Methods Mol. 
Biol. 55:121-131.) 

Therapeutic Uses of dithp 

The dithp of the invention may be used for somatic or genhline gene therapy. Gene therapy 
may be performed to (i) correct a genetic deficiency (ag., in the cases of severe combined 
immunodeficiency (SCID)-Xl disease characterized by X-linked inheritance (Cavazzana-Calvo, M. et 
al (2000) Science 288:669-672), severe combined immunodeficiency syndrome associated with an 
inherited adeno^^ (1995) Science 270:475-480; 

Bordignon, C. et al. (1995) Science 270:470-475), cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207- 
216; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:667-703), thalassemias, familial hypercholesterolemia, and hemophilia resulting from Factor 
Vm or Factor IX deficiencies (Crystal, R.G. (1995) Science 270:404-410; Verma, LM. and Somia, N. 
(1997) Nature 389:239-242)), (ii) express a conditionally lethal gene product (ag., in the case of 
cancers which result from unregulated cell proliferation), or (iii) express a protein which affords • 
protection against intracellular parasites (e.g., against human retroviruses, such as human 
immunodeficiency virus (HIV) (Baltimore, D. (1988) Nature 335:395-396; PoescWa, E. et al. (1996) 
Proc. NatL Acad. Sci. USA. 93:11395-11399), hepatitis B or C virus (HBV, HCV); fungal parasites, 
such as Candida albicans and Paracoccidioides brasiliensis ; and protozoan parasites such as 
Plasmodium falciparum and Trypanosoma cruzi) . In the case where a genetic deficiency in dithp 
expression or regulation causes disease, the expression of dithp from an appropriate population of 
transduced cells may alleviate the clinical manifestations caused by the genetic deficiency. 

In a further^mbodiment of the invention, diseases or disorders caused by deficiencies in dithp 
are treated by constructing mammalian expression vectors comprising dithp and introducing these 
vectors by mechanical means into dithp-deficient cells. Mechanical transfer technologies for use with 
cells in vivo or ex vitro include (i) direct DNA microinjection into individual ceils, (ii) ballistic gold 
particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and (v) the 
use of DNAtransposons (Morgan, R.A. and Anderson, W.F. (1993) Annu. Rev. Biochem. 62:191-217; 
Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. and R&ipon, H. (1998) Cuit. Opia BiotechnoL 9:445- 
450). 

Expression vectors that may be effective for the expression of dithp include, but are not limited 
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to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad CA), 
PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jofla CA), and PTET-OFF, 
PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). The dithp of the invention 
may be expressed using (i) a constitutive^ active promoter, (e»g., from cytomegalovirus (CMV), Rous 
sarcoma virus (RSV), SV40 virus, thymidine kinase (TK), or p-actin genes), (ii) an inducible promoter 
(e.g., the tetracycline-regulated promoter (Gossen, M. and Bujard, H. (1992) Proc. Natl. Acad Sci. 
U.S. A 89:5547-5551; Gossen, M. et al., (1995) Science 268:1766-1769; Rossi, F.M.V. and Blau, 
H.M. (1998) Curr. Opin. Biotechnol. 9:451-456), commercially available in the T-REX plasmid 
(Invitrogen); the ecdysone-inducible promote (available in the plasmids PVGRXR and PIND; 
Invitrogen); the FK506/rapamycin inducible promote; or the RU486/mifepristone inducible promote 
(Rossi, F.M. V. and Blau, H.M. supra) , or (iii) a tissue-specific promote or the native promote of the 
endogenous gene encoding DITHP from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate m^hod 
(Graham, F.L. and Eb, A J. (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1982)EMBOJ. 1:841-845). The introduction of DNA to primary cells requires modification of these 
standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to dithp expression are treated by constructing a retrovirus vector consisting of (i) dithp under 
the control of an independent promote or the retrovirus long terminal repeat (LTR) promote, (ii) 
appropriate RNA packaging signals, and (iii) a Rev-responsive dement (RRE) along with additional 
retrovirus m-acting RNA sequences and coding sequences required for efficient vector propagatioa 
Retrovirus vectors (e.g., PFB and PFBNEO) are commercially available (Stratagene) and are based on 
published data (Riviere, L et al. (1995) Proc. Nad. Acad. Sci. U.S. A 92:6733-6737), incorporated by 
reference herein. The vector is propagated in an appropriate vector producing cell line (VPCL) that 
expresses an envelope gene with a tropism for receptors on the target cells or a promiscuous envelope 
proton such as VSVg (Armentano, D. et al. (1987) J. Virol. 61:1647-1650; Bender, M.A et al. (1987) 
J. Virol. 61:1639-1646; Adam, M.A and Miller, AD. (1988) J. ViroL 62:3802-3806; Dull, T. et al. 
(1998) J. Virol. 72:8463-8471; Zufferey, R. et al (1998) J. Virol. 72:9873-9880). U.S. Patent Number 
5,910,434 to Rigg ("Method for obtaining retrovirus packaging cell lines producing high transducing 
efficiency retroviral supernatant") discloses a method for obtaining retrovirus packaging cell lines and 
is hereby incorporated by reference. Propagation of retrovirus vectors, transduction of a population of 
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ceDs (e.g. , CD4 + T-cells), and the return of transduced cells to a patient are procedures well known to 
persons skilled in the art of gene therapy and have been well documented (Ranga, U. et al. (1997) J. 
Virol 71:7020-7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 
71:4707-4716; Ranga, U. et al. (1998) Proc. Natl. Acad. Sd. U.S.A. 95:1201-1206; Su, L. (1997) 
5 Blood 89:2283-2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver dithp to 
cells which have one or more genetic abnormalities with respect to the expression of dithp. The 
construction and packaging of adenovirus-based vectors are well known to those with ordinary skill in 
the art. Replication defective adenovirus vectors have proven to be versatile for importing genes 

1 o encoding immunoregulatory proteins into intact islets in the pancreas (Csete, M.E. et al. (1 995) 

Transplantation 27:263-268). Potentially useful adenoviral vectors are described in U.S. Patent 
Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), hereby incorporated by 
reference. For adenoviral vectors, see also Antinozzi, P. A. et al. (1999) Anna. Rev. Nutr. 19:511-544 
and Verma, I.M. and Somia, N. (1997) Nature 18:389:239-242, both incorporated by reference herein. 
15 In another alternative, a herpes-based, gene therapy delivery system is used to deliver dithp to 

target cells which have one or more genetic abnormalities with respect to the expression of dithp. The 
use of herpes simplex virus (HS V)-based vectors may be especially valuable for introducing dithp to 
cells of the central nervous system, for which HSV has a tropism. The construction and packaging of 
hopes-based vectors are well known to those with ordinary skill in the art. A repUcation-competent 

2 o hopes simplex virus (HSV) type 1 -based vector has been used to deliver a reporter gene to the eyes of 

primates (Liu, X. et al. (1999) Exp. Eye Res. 169:385-395). The construction of a HSV-1 virus vector 
has also been disclosed in detail in U.S. Patent Number 5,804,413 to DeLuca ("Hopes simplex virus 
strains for gene transfer"), which is hereby incorporated by reference. U.S. Patent Number 5,804,413 
teaches the use of recombinant HSV d92 which consists of a genome containing at least one exogenous 

2 5 gene to be transferred to a cell unda: the control of the appropriate promoter for purposes including 

human gene therapy. Also taught by this patent are the construction and use of recombinant HSV 
strains deleted for ICP4, ICP27 and ICP22. For HSV vectors, see also Goins, W. F. et al. 1999 J. 
Virol. 73:519-532 and Xu, H. et al., (1994) Dev. Biol. 163:152-161, hereby incorporated by reference. 
The manipulation of cloned herpesvirus sequences, the generation of recombinant virus following the 

3 o transfection of multiple plasmids containing different segments of the large herpesvirus genomes, the 

growth and propagation of herpesvirus, and the infection of cells with herpesvirus are techniques well 
known to those of ordinary skill in the art 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver dithp to target cells. The biology of the prototypic alphavirus, Semliki Forest Virus (SFV), has 
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been studied extensively and gene transfer vectors have been based on the SFV genome (Garoff, H. and 
Li, K-J. (1998) Curr. Opin. Biotech. 9:464-469). During alphavirus RNA replication, a subgenomic 
RNA is generated that normally encodes the viral capsid proteins. This subgenomic RNA replicates to 
higher levels than the Ml-length genomic RNA, resulting in the overproduction of capsid proteins 
relative to the viral proteins with enzymatic activity (e.g., protease and polymerase). Similarly, 
inserting dithp into the alphavirus genome in place of the capsid-coding region results in the production 
of a large number of dithp RNAs and the synthesis of high levels of DITHP in vector transduced cells. 
While alphavirus infection is typically associated with cell lysis within a few days, the ability to 
establish a persistent infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus 
(SIN) indicates that the lytic replication of alphaviruses can be altered to suit the needs of the gene 
therapy application (Dryga, S.A. et al. (1997) Virology 228:74-83). The wide host range of 
alphaviruses will allow the introduction of dithp into a variety of ceil types. The specific transduction 
of a subset of cells in a population may require the sorting of cells prior to transduction. The methods 
of manipulating infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA 
transfections, and performing alphavirus infections, are well known to those with ordinary skin in the 
art. 

Antibodies 

Anti-DITHP antibodies may be used to analyze protein expression levels. Such antibodies 
include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, and Fab fragments. For 
descriptions of and protocols of antibody technologies; see, eg., Pound J.D. (1998) Immunochemical 
Protocols. Humana Press, Totowa, NJ. 

The amino acid sequence encoded by the dithp of. the Sequence Listing may be analyzed by 
appropriate software (e.g., LASERGENE NAVIGATOR software, DNASTAR) to determine regions 
of high immunogenicity. Hie optimal sequences for immunization are selected from the C-terminus, the 
N-terminus, and those intervening, hydrophilic regions of the polypeptide which are likely to be exposed 
to the ©eternal environment when the polypeptide is in its natural conformatioa Analysis used to select 
appropriate epitopes is also described by Ausubel (1997, supra , Chapter 1 1 .7). Peptides used for 
antibody induction do not need to have biological activity; however, they must be antigenic. Peptides 
used to induce specific antibodies may have an amino acid sequence consisting of at least five amino 
acids, preferably at least 10 amino acids, and most preferably at least 15 amino acids. A peptide which 
mimics an antigenic fragment of the natural polypeptide may be fused with another protein such as 
keyhole limpet hemocyanin (KLH; Sigma, St. Louis MO) for antibody production. A peptide 
encompassing an antigenic region may be expressed from a dithp, synthesized as described above, or 
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purified from human cells. 

Procedures well known in the art may be used for the production of antibodies. Various hosts 
including mice, goats, and rabbits, may be immunized by injection with a peptide. Depending on the 
host species, various adjuvants may be used to increase immunological response. 

In one procedure, peptides about 15 residues in length may be synthesized using an ABI 431 A 
peptide synthesizer (Applied Biosystems) using finoc-chemistry and coupled to KLH (Sigma) by 
reaction with M-maleimidobCTZoyl-N-hydroxysucdnimide ester (Ansubd, 1995, supra) . Rabbits are 
immunized with the peptide-KLH complex in complete Freund's adjuvant The resulting antisera are 
tested for antipeptide activity by binding the peptide to plastic, blocking with 1 % bovine serum albumin 
(BS A), reacting with rabbit antisera, washing, and reacting with radioiodinated goat anti-rabbit IgG. 
Antisera with antipeptide activity are tested for anti-DITHP activity using protocols well known in the 
art, including BLISA, radioimmunoassay (RIA), andimn}nnoulotLing. 

In another procedure, isolated and purified peptide may be used to immunize mice (about 100 
ug of peptide) or rabbits (about 1 mg of peptide). Subsequently, the peptide is radioiodinated and used 
to screen the immunized animals* B-lymphocytes for production of antipeptide antibodies. Positive 
cells are then used to produce hybridomas using standard techniques. About 20 mg of peptide is 
sufficient for labeling and screening several thousand clones. Hybridomas of interest are detected by 
screening with radioiodinated peptide to identify those fusions producing peptide-specific monoclonal 
antibody. In a typical protocol, wells of a multi-well plate (FAST, Becton-Dickinson, Palo Alto, C A) 
are coated with affinity-purified, specific rabbit-anti-mouse (or suitable anti-species IgG) antibodies at 
10 mg/ml. The coated wells are blocked with 1% BSA and washed and exposed to supematants from 
hybridomas. After incubation, the wells are exposed to radiolabeled peptide at 1 mg/ml. 

Clones producing antibodies bind a quantity of labeled peptide that is detectable above 
background. Such clones are expanded and subjected to 2 cycles of cloning. Cloned hybridomas are 
injected into pristane-treated mice to produce ascites, and monoclonal antibody is purified from the 
ascitic fluid by affinity chromatography on protein A (Amersham Pharmacia Biotech). Several 
procedures for the production of monoclonal antibodies, including in vitro production, are described in 
Pound (supra) . Monoclonal antibodies with antipeptide activity are tested for anti-DITHP activity 
using protocols well known in the art, including ELIS A RIA, and immunoblotting. 

Antibody fragments containing specific binding sites for an epitope may also be generated For 
example, such fragments include, but are not limited to, the F(ab*)2 fragments produced by pepsin 
digestion of the antibody molecule, and the Fab fragments generated by reducing the disulfide bridges of 
the F(ab')2 fragments. Alternatively, construction of Fab expression libraries in filamentous 
bacteriophage allows rapid and easy identification of monoclonal fragments with desired specificity 
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(Pound, supra . Chaps. 45-47). Antibodies generated against polypeptide encoded by dithp can be used 
to purify and characterize full-length DITHP protean and its activity, binding partners, etc. 

Assays Using Antibodies 

5 Anti-DITHP antibodies may be used in assays to quantify the amount of DITHP found in a 

particular human cell. Such assays include methods utilizing the antibody and a label to detect 
expression level under normal or disease conditions. The peptides and antibodies of the invention may 
be used with or without modification or labeled by joining them, either covalently or noncovalently , 
with a reporter molecule. 

l o Protocols for detecting and measuring protein expression using either polyclonal or monoclonal 

antibodies are well known in the art. Examples include ELIS A, RIA, and fluorescent activated cell 
sorting (FACS). Such immunoassays typically involve iile formation of complexes between the DITHP 
and its specific antibody and the measurement of such complexes. These and other assays are described 
in Pound (supra) . 

15 Without further elaboration, it is believed that one skilled in the art can, using the preceding 

description, utilize the present invention to its fullest extent The following preferred specific 
embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder of 
the disclosure in any way whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and below, 

20 including U.S. Ser. No. 60/184,777, U.S. Ser. No. 60/184,797, U.S. Ser. No. 60/184,698, U.S. Ser. 
No. 60/184,770, U.S. Ser. No. 60/184,774, U.S. Ser. No. 60/184,693, U.S. Ser. No. 60/184,771,U.S. 
Ser. No. 60/184,813, U.S. Ser. No. 60/184,773, U.S. Ser. No. 60/184,776, U.S. Ser. No. 60/184,769, 
U.S. Ser. No. 60/184,768, U.S. Ser. No. 60/184,837, U.S. Ser. No. 60/184,697, U.S. Ser. No. 
60/184,841, U.S. So*. No. 60/184,772, U.S. Ser. No. 60/185,213, U.S. Ser. No. 60/185,216, U.S. Ser. 

25 No. 60/204,863, U.S. Ser. No. 60/205,221, U.S. Ser. No. 60/204,815, U.S. Ser. No. 60/203,785, U.S. 
Ser. No. 60/204,821, U.S. Ser. No. 60/204,908, U.S. Ser. No. 60/204,226, U.S. Ser. No. 60/204,525, 
U.S. Ser. No. 60/205,285, U.S. Ser. No. 60/205,232, U.S. Ser. No. 60/205,323, U.S. Ser. No. 
60/205,287, U.S. Ser. No. 60/205,324, and U.S. Ser. No. 60/205,286, are hereby expressly 
incorporated by reference. 

30 

EXAMPLES 

I. Construction of cDN A Libraries 

RNA was purchased from CLONTECH Laboratories, Inc. (Palo Alto CA) or isolated from 
various tissues. Some tissues were homogenized and lysed in guanidinium isothiocyanate, while others 
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were homogenized and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life 
Technologies), a monophasic solution of phenol and guanidine isothiocyanate. Hie resulting lysates 
were centrifuged ova: CsCl cushions or extracted with chloroform. RNA was precipitated with either 
isopropanol or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In most cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega Corporation (Promega), Madison WI), 
OLIGOTEX latex particles (QIAGEN, Inc. (QIAGEN), Valencia CA), or an OLIGOTEX mRNA 
purification kit (QIAGEN). Alternatively, RNA was isolated directly from tissue lysates using other 
RNA isolation kits, e.g., the POLY(A)PURE mRNA purification kit (Ambion, Inc., Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 
libraries. Otherwise, cDNA was synthesized and cDN A libraries ware constructed with the UN1ZAP 
vector system (Stratagene Cloning Systems, Inc. (Stratagene), La Jolla CA) or SUPERSCRIPT 
plasmid system (Life Technologies), using the recommended procedures or similar methods known in 
the art (See, e.g., Ausubei, 1997, supra. Chapters 5.1 through 6.6.) Reverse transcription was 
initiated using oligo d(T) or random primers. Synthetic oligonucleotide adapters were ligated to double 
stranded cDNA, and the cDNA was digested with the appropriate restriction enzyme or enzymes': For 
most libraries, the cDNA was siz^selected (300-1000 bp) using SEPHACRYL S1000, SEPHAROSE 
CL2B, or SEPHAROSE CL4B column chromatography (Amersham Pharmacia Biotech) or 
preparative agarose t gel electrophoresis. cDNAs were ligated into compatible restriction enzyme sites of 
thepolylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid 
(Life Technologies), PCDNA2.1 plasmid (Invitrogen, Carlsbad CA), PBK-CMV plasmid (Stratagene), 
or pINC Y (Incyte Genomics, Palo Alto CA), or derivatives thereof. Recombinant plasmids were 
transformed into competent E. coli cells including XLl-Blue, XLl-BlueMRF, or SOLR from 
Stratagene or DH5ct, DH10B, or ElectroMAX DH10B from Life Technologies. 

II. Isolation of cDNA Clones 

Plasmids were recovered from host cells by in vivo excision using the UNIZAP vector system 
(Stratagene) or by cell lysis. Plasmids were purified using at least one of the following: the Magic or 
WIZARD Miniprqjs DNA purification system (Promega); the AGTC Miniprep purification kit (Edge 
BioSystems, Gaithersburg MD); and the QIAWELL 8, QIAWELL 8 Plus, and QIAWELL 8 Ultra 
plasmid purification systems or the RE. AX. PREP 96 plasmid purification kit (QIAGEN). Following 
precipitation, plasmids ware resuspended in 0. 1 ml of distilled water and stored, with or without 
lyophilization,at4 0 C. 
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Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
Mghrthroughput format (Rao, V.B. (1994) Anal Biochem 216:1-14.) Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples ware processed and stored in 384- 
wdl plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
PICOGKEEN dye (Molecular Probes, Inc. (Molecular Probes), Eugene OR) and a FLUOROSKAN n 
fluorescence scanner (Labsystems Oy, Helsinki, Finland). 

III. Sequencing and Analysis 

cDN A sequencing reactions were processed using standard methods or high-throughput 
instrumentation such as the ABI CATALYST 800 thermal cycler (Applied Biosystems) or the PTC- 
200 thermal cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific 
Corp., Sunnyvale CA) or the MICROLAB 22G0 liquid transfer Systran (Hamilton). cDN A sequencing 
reactions ware prepared using reagents provided by Amersham Pharmacia Biotech or supplied in ABI 
sequencing kits such as the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit 
(Applied Biosystems). Electrophoretic separation of cDNA sequencing reactions and detection of 
labeled polynucleotides were carried out using the MEGAB ACE 1000 DNA sequencing system 
(Molecular Dynamics); the ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in 
conjunction with standard ABI protocols and base calling software; or otto sequence analysis systems 
known in the art. Reading frames within the cDNA sequences ware identified using standard methods 
(reviewed in Ausubei, 1997, supra. Chapter 7.7). Some of the cDNA sequences were selected for 
extension using the techniques disclosed in Example VIII. 

IV. Assembly and Analysis of Sequences 

Component sequences from chromatograms were subject to PHRED analysis and assigned a 
quality score. The sequences having at least a required quality score wore subject to various pro- 
processing editing pathways to eliminate, e.g., low quality 3' ends, vector and linker sequences, polyA 
tails, Alu repeats, mitochondrial and ribosomal sequences, bacterial contamination sequences, and - 
sequences smaller than 50 base pairs. In particular, low-information sequences and repetitive elements 
(e.g., dinucleotide repeats, Alu repeats, etc.) were replaced by "n's", or masked, to prevent spurious 
matches. 

Processed sequences were thai subject to assembly procedures in which the sequences were 
assigned to gene bins (bins). Each sequence could only belong to one bia Sequences in each gene bin 
were assembled to produce consensus sequences (templates). Subsequent new sequences were added to 
existing bins using BLASTn (v. 1 .4 WashU) and CROSSMATCH. Candidate pairs were identified as 
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all BLAST hits having a quality score greater than or equal to 150. Alignments of at least 82% local 
identity were accepted into the bin. The component sequences from each bin were assembled using a 
version of PHRAP. Bins with several overlapping component sequences were assembled using DEEP 
PHRAP. The orientation (sense or antisense) of each assembled template was determined based on the 
number and orientation of its component sequences. Template sequences as disclosed in the sequence 
listing correspond to sense strand sequences (the 'forward" reading frames), to the best determinatioa 
The complementary (antisense) strands are inherently disclosed herein. The component sequences 
which ware used to assemble each template consensus sequence are listed in Table 4, along with their 
positions along the template nucleotide sequences. 

Bins were compared against each other and those having local similarity of at least 82% were 
combined and reassembled Reassembled bins having templates of insufficient ova-lap (less than 95% 
local identity) ware re-split Ass embied templates were also subject to analysis by ST1TCHER/EXON 
MAPPER algorithms which analyze the probabilities of the presence of splice variants, alternatively 
spliced exons, splice junctions, differential expression of alternative spliced genes across tissue types or 
disease states, etc. These resulting bins were subject to several rounds of the above assembly 
procedures. 

Once gene bins were generated based upon sequence alignments, bins were clone joined based 
upon clone information. If the 5' sequence of one clone was presort in one bin and the 3' sequence from 
the same clone was present in a different bin, it was likely that the two bins actually belonged together 
in a single bin. The resulting combined bins underwent assembly procedures to regenerate the 
consensus sequences. 

The final assembled templates were subsequently annotated using the following procedure. 
Template sequences were analyzed using BLASTn (v2.0, NCBI) versus gbpri (GenBank version 120). 
"Hits" were defined as an exact match having from 95% local identity over 200 base pairs through 
1 00% local identity ova* 1 00 base pairs, or a homolog match having an E-value, i.e. a probability 
score, of <; 1 x 10"*. The hits were subject to frameshift FASTx versus GENPEPT (GenBank version 
120). (See Table 7). In this analysis, a homolog match was defined as having an E-value of * 1 x 10* 8 . 
The assembly method used above was described in "System and Methods for Analyzing Biomolecular 
Sequences/' U.S.S.N. 09/276,534, filed March 25, 1999, and the LIFESEQ Gold user manual (Incyte) 
both incorporated by reference hereaa 

Following assembly, template sequences were subjected to motif, BLAST, and functional 
analyses, and categorized in protein hierarchies using methods described in, e.g., "Database System 
Employing Protein Function Hierarchies for Viewing Biomolecular Sequence Data," U.S.S.N. 
08/812,290, filed March 6, 1997; "Relational Database for Storing Biomolecule Information," 
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U.S.S.N. 08/947,845, filed October 9, 1997; "Project-Based Full-Length Biomolecular Sequence 
Database," U.S.S.N. 08/811,758, filed March 6, 1997; and 'Relational Database and System for 
Storing Information Relating to Biomolecular Sequences," U.S.S.N. 09/034,807, filed March 4, 1998, 
all of which are incorporated by reference herein. 

The template sequences were further analyzed by translating each template in all three forward 
reading frames and searching each translation against the Pfam database of hidden Markov model- 
based protean families and domains using the HMMER software package (available to the public from 
Washington University School of Medicine, St. Louis MO). Regions of templates which, when 
translated, contain similarity to Pfam consensus sequences are reported in Table 2, along with 
descriptions of Pfam protein domains and families. Only those Pfam hits with an E-value of s 1 x 10" 3 
are reported (See also World Wide Web site http-J/pfara wustl.edu/ for detailed descriptions of Pfam 
protein domains and families.) 

Additionally, the template sequences were translated in all three forward reading frames, and 
each translation was searched against hidden Markov models for signal peptides using the HMMER 
software package. Construction of hidden Markov models and their usage in sequence analysis has 
been described (See, for example, Eddy, S.R. (1996) Curr. Opia Str. Biol. 6:361-365.) Only those 
signal peptide hits with a cutoff score of 1 1 bits or greater are reported A cutoff score of 1 1 bits or 
greater corresponds to at least about 91-94% true-positives in signal peptide prediction. Template 
sequences were also translated in all three forward reading frames, and each translation was searched 
against TMAP, a program that uses weight matrices to delineate transmembrane segments on protein 
sequences and determine orientation, with respect to the cell cytosol (Persson, B. and P. Argos (1994) J. 
Mol. Biol. 237:182492; Persson, B. and P. Argos (1996) Protein Sci. 5:363-371). Regions of 
templates which, when translated, contain similarity to signal peptide or transmembrane consensus 
sequences are reported in Table 3. 

The results of HMMER analysis as reported in Tables 2 and 3 may support the results of 
BLAST analysis as reported in Table 1 or may suggest alternative or additional properties of template- 
encoded polypeptides not previously uncovered by BLAST or other analyses. 

Template sequences are further analyzed using the bioinformatics tools listed in Table 7, or 
using sequence analysis software known in the art such as MACDNASIS PRO software (Hitachi 
Software Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Template 
sequences may be further queried against public databases such as the GenBank rodent, mammalian, 
vertebrate, prokaryote, and eukaryote databases. 

The template sequences were translated to derive the corresponding longest open reading frame 
as presented by the polypeptide sequences. Alternatively, a polypeptide of the invention may begin at 
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any of the methionine residues within the full length translated polypeptide. Polypeptide sequences 
were subsequently analyzed by querying against the GenBank protein database (GENPEPT, (GenBank 
version 121)). Foil length polynucleotide sequences are also analyzed using MACDNASIS PRO 
software (Hitachi Software Engineering, South San Francisco CA) and LASERGENE software 
(DNASTAR). Polynucleotide and polypeptide sequence alignments are generated using default 
parameters specified by the CLUSTAL algorithm as incorporated into the MEGALIGN multisequence 
alignment program (DNASTAR), which also calculates the percent identity between aligned sequences. 

Table 6 shows sequences with homology to the polypeptides of the invention as identified by 
BLAST analysis against the GenBank protein (GENPEPT) database. Column 1 shows the polypeptide 
sequence identification number (SEQ ID NO:) for the polypeptide segments of the invention. Column 2 
shows the reading frame used in the translation of the polynucleotide sequences encoding the 
polypeptide segments. Column 3 shows the length of the translated polypeptide segments. Columns 4 
and 5 show the start and stop nucleotide positions of the polynucleotide sequences encoding the 
polypeptide segments. Column 6 shows the GenBank identification number (GI Number) of the nearest 
GenBank homolog. Column 7 shows the probability score for the match between each polypeptide and 
its GenBank homolog. Column 8 shows the annotation of the GenBank homolog. 
V. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene 
and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs from a 
particular cell type or tissue have been bound (See, e,g„ Sambrook, supra, ch. 7; Ausubel, 1995, 
supra , ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or r dated 
molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer 
search can be modified to determine whether any particular match is categorized as exact or similar. 
The basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 
5 x minimum {length(Seq. 1), length(Seq. 2)} 

Hie product score takes into account both the degree of similarity between two sequences and the length 
of the sequence match. The product score is a normalized value between 0 and 100, and is calculated 
as follows: the BLAST score is multiplied by the percent nucleotide identity and the product is divided 
by (5 times the length of the shorter of the two sequences). The BLAST score is calculated by 
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assigning a score of +5 for every base that matches in a high-scoring segment pair (HSP), and -4 for 
every mismatch. Two sequences may share more than one HSP (separated by gaps). If tee is more 
than one HSP, thai the pair with the highest BLAST score is used to calculate the product score. The 
product score represents a balance between fractional overlap and quality in a BLAST alignment. For 
example, a product score of 100 is produced only for 100% identity over the entire length of the shorter 
of the two sequences being compared. A product score of 70 is produced either by 100% identity and 
70% overlap at one end, or by 88% identity and 100% overlap at the other. A product score of 50 is 
produced either by 100% identity and 50% overlap at one end, or 79% identity and 100% ova-lap. 

VI. Tissue Distribution Profiling 

A tissue distribution profile is determined for each template by compiling the cDNA library 
tissue classifications of its component cDNA sequences. Each component sequence, is derived from a 
cDNA library constructed from a human tissue. Each human tissue is classified into one of the 
following categories: cardiovascular system; connective tissue; digestive system; emhryonic structures; 
endocrine system; exocrine glands; genitalia, female; genitalia, male; germ cells; hemic and immune 
system; liver; musculoskeletal system; nervous system; pancreas; respiratory system; sense organs; 
skin; stomatognathic system; unclassified/mixed; or urinary tract. Template sequences, component 
sequences, and cDNA library/tissue information are found in the LIFESEQ GOLD database (Incyte 
Genomics, Palo Alto CA). 

Table 5 shows the tissue distribution profile for the templates of the invention. For each 
template, the three most frequently observed tissue categories are shown in column 3, along with the 
percentage of component sequences belonging to each category. Only tissue categories with 
percentage values of * 10% are shown. A tissue distribution of "widely distributed" in column 3 
indicates percentage values of <10% in all tissue categories. 

VTL Transcript Image Analysis 

Transcript images are generated as described in Seilhamer et al., "Comparative Gene 
Transcript Analysis," U.S. Patent Number 5,840,484, incorporated herein by reference. 

VHI. Extension of Polynucleotide Sequences and Isolation of a Full-length cDNA 

Oligonucleotide primers designed using a dithp of the Sequence Listing are used to extend the 
nucleic acid sequence. One primer is synthesized to initiate 5' extension of the template, and the other 
prima, to initiate 3' extension of the template. The initial primers may be designed using OLIGO 4.06 
software (National Biosciences, Inc. (National Biosciences), Plymouth MN), or another appropriate 
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program, to be about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to 
anneal to the target sequence at temperatures of about 68 °C to about 72°C. Any stretch of nucleotides 
which would result in hairpin structures and primer-primer dimerizations are avoided Selected human 
cDNA libraries are used to extend the sequence. If more than one extension is necessary or desired, 
5 additional or nested sets of primers are designed 

High fidelity amplification is obtained by PCR using methods well known in the art PCR is 
performed in 96-weH plates using the PTC-200 thermal cycler (MJ Research). The reaction mix 
contains DNA template^ 200 nmol of each primer, reaction buffer containing Mg 2 *, (NH 4 ) 2 S0 4 , and B- 
mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme (Life 

1 o Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for prima- pair 

PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 2 
min; Step 5: Steps %, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ are as follows: Step 1 : 94°C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and4 repeated 20 times; 
15 Step 6: 68°C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well is determined by dispensing 100 \il PICOGREEN 
quantitation reagent (0.25% (v/v); Molecular Probes) dissolved in IX Tris-EDTA (TE) and 0.5 ^1 of 
undiluted PCR product into each well of an opaque fluorimeter plate (Corning Incorporated (Corning), 
Corning NY), allowing the DNA to bind to the reagent The plate is scanned in a FLUOROSKAN II 

2 o (Labsystems Oy) to measure the fluorescence of the sample and to quantify the concentration of DNA. 

A 5 \xl to 10 \il aliquot of the reaction mixture is analyzed by electrophoresis on a 1 % agarose mini-gel 
to determine which reactions are successful in extending the sequence* 

The extended nucleotides are desalted and concentrated, transferred to 384-well plates, digested 
with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and sonicated or 

2 5 sheared prior to rdigation into pUC 1 8 vector (Amersham Pharmacia Biotech). For shotgun 

sequencing, the digested nucleotides are separated on low concentration (0.6 to 0.8%) agarose gels, 
fragments are excised, and agar digested with AGAR ACE (Promega). Extended clones are religated 
using T4 ligase (New England Biolabs, Inc., Beverly MA) into pUC 18 vector (Amersham Pharmacia 
Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, and 

3 o transfected into competent E. coli cells. Transformed cells are selected on antibiotic-containing media, 

individual colonies are picked and cultured overnight at 37°C in 384-well plates in LB/2x carbenicillin 
liquid media. 

The cells are lysed, and DNA is amplified by PCR using Taq DNA polymerase (Amersham 
Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters: Step 1 : 
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94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2f min; Step 5: steps 2, 3, and 4 
repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA is quantified by PICOGREEN 
reagent (Molecular Probes) as described above. Samples with low DNA recoveries are reamplified 
using the same conditions as described above. Samples are diluted with 20% dimethysulfoxide (1 :2, 
v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYENAMIC 
DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE Terminator cycle 
sequencing ready reaction kit (Applied Biosystems). \ 

In like manner, the ditbp is used to obtain regulatory sequences (promoters, infrons, and 
enhancers) using the procedure above, oligonucleotides designed for such extension, and an appropriate 
genomic library. 

IX. Labeling of Probes and Southern Hybridization Analyses 

Hybridization probes derived from the dithp of the Sequence Listing are employed for screening 
cDNAs, mRNAs, or genomic DNA. The labeling of probe nucleotides between 100 and 1000 
nucleotides in length is specifically described, but, essentially the same procedure may be used with 
larger cDNA fragments. Probe sequences are labeled at room temperature for 30 minutes using a T4 
polynucleotide kinase, y^-ATP, and 0.5X One-Phor-AU Plus (Amersham Pharmacia Biotech) buffer 
and purified using a ProbeQuant G-50 Microcolumn (Amersham Pharmacia Biotech). The probe 
mixture is diluted tb 10 7 dpm/|ig/ml hybridization buffer and used in a typical membrane-based 
hybridization analysis. 

The DNA is digested with a restriction endonuclease such as Eco RV and is electrophoresed 
through a 0.7% agarose gel. The DNA fragments are transferred from the agarose to nylon membrane 
(NYTRAN Plus, Schleicher & Schuell, Inc., Keene NH) using procedures specified by the 
manufacturer of the membrane. Prehybridization is carried out for three or more hours at 68 °C, and 
hybridization is carried out overnight at 68 ° C. To remove non-specific signals, blots are sequentially 
washed at room temperature under increasingly stringent conditions, up to 0, Ix saline sodium citrate 
(SSC) and 0.5% sodium dodecyl sulfate. After the blots are placed in a PHOSPHORIMAGER cassette 
(Molecular Dynamics) or are exposed to autoradiography film, hybridization patterns of standard and 
experimental lanes are compared Essentially the same procedure is employed when screening RNA 

X. Chromosome Mapping of dithp 

The cDNA sequences which were used to assemble SEQ ID NO:l-21 1 are compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith-Waterman algorithm Sequences from these databases that match SEQ 
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ID NO.i-21 1 are assembled into clusters of contiguous and overlapping sequences using assembly 
algorithms such as PHRAP (Table 7). Radiation hybrid and genetic mapping data available from 
public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for Genome 
Research (WIGR), and G6n£thon are used to determine if any of the clustered sequences have been 
5 previously mapped. Inclusion of a mapped sequence in a cluster will result in the assignment of all 
sequences of that cluster, including its particular SEQ ID NO:, to that map location. The genetic map 
locations of SEQ ID NO: 1-21 1 are described as ranges, or intervals, of human chromosomes. The map 
position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's p- 
arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between 

1 o chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 

humans, although this can vary widely due to hot and cold spots of recombination.) The cM distances 
are based on genetic markers mapped by G&£thon which provide boundaries for radiation hybrid 
markers whose sequences were included in each of the clusters. 

15 XI. Microarray Analysis 

Probe Preparation from Tissue or Cell Samples 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
polyA + RNA is purified using the oligo (dT) cellulose method. Each polyA + RNA sample is reverse 
transcribed using MMLV reverse-franscriptase, 0.05 pg/^1 oligo-dT primer (21mer), IX first strand 

2 o buffo-, 0.03 units/nl RNase inhibitor, 500 nM dATP, 500 \xM dGTP, 500 \xM dTTP, 40 jiM dCTP, 

40 yM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse transcription 
reaction is performed in a 25 ml volume containing 200 ng polyA + RNA with GEMBRIGHT kits 
(Incyte). Specific control polyA* RNAs are synthesized by in vitro transcription from non-coding yeast 
genomic DNA (W. Lei, unpublished). As quantitative controls, the control mRNAs at 0.002 ng, 0.02 

25 ng, 0.2 ng, and 2 ng are diluted into reverse transcription reaction at ratios of 1 :100,000, 1 :10,000, 
1:1000, l:100(w/w) to sample mRNA respectively. The control mRNAs are diluted into reverse 
transcription reaction at ratios of 1:3, 3:1, 1:10, 10:1, 1:25, 25:1 (w/w) to sample mRNA differential 
expression patterns. After incubation at 37° C f or 2 hr, each reaction sample (one with Cy3 and another 
with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and incubated for 20 minutes at 

30 85° C to the stop the reaction and degrade the RNA. Probes are purified using two successive 

CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. (CLONTECH), Palo 
Alto CA) and after combining, both reaction samples are ethanol precipitated using 1 ml of glycogen (1 
mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The probe is then dried to completion 
using a SpeedVAC (Savant Instruments Inc., HoIbrookNY) and resuspended in 14 \i\ 5X SSC/0.2% 
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SDS. ' 

MiCTparrav Preparation 

Sequences of the present invention are used to generate array elements. Each array element is 
5 amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1 -2 ng to a final quantity greater than 5 jig. 
Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia Biotech). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
10 slides (Corning) are cleaned by ultrasound in 0. 1 % SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR Scientific 
Products Corporation fv^R), West Chester, PA), washed extensively in distilled water, and coated 
with 0.05% aminqpropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10°C oven. 

Array elements are applied to the coated glass substrate using a procedure described in US 
1 5 Patent No. 5 ,807,522, incorporated herein by reference. 1 fil of the array element DNA, at an average 
concentration of 100 ng/nl, is loaded into the open capillary printing dement by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
2 o Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 

buffered saline (PBS) (Tropix, Inc., Bedford, MA) for 30 minutes at 60° C followed by washes in 0.2% 
SDS and distilled water as before. 

Hybridization 

25 Hybridization reactions contain 9 jllI of probe mixture consisting of 0.2 \ig each of Cy3 and 

Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffo*. The probe mixture 
is heated to 65° C for 5 minutes and is aliquoted onto the microarray surface and covered with an 1.8 
cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly larger 
than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 140 pi of 

30 5x SSC in a corner of the chamber. Hie chamber containing the arrays is incubated for about 6.5 

hours at 60° C. The arrays are washed far 10 min at 45° C in a first wash buffo: (IX SSC, 0.1% SDS), 
three times for 1 0 minutes each at 45° C in a second wash buffer (0. IX SSC), and dried. 

Detection 
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Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). Hie slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1 .8 cm x 1 . 8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewatei NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fiuGrophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

Hie sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the probe mix at a known concentration. A specific location on the 
array contains a complementary DN A sequence, allowing the intensity of the signal at that location to 
be correlated with a weight ratio of hybridizing species of 1 : 1 00,000. When two probes from different 
sources (eg., representing test and control cells), each labeled with a different fluorophore, are 
hybridized to a single array for the purpose of identifying genes that are differentially expressed, the 
calibration is done by labeling samples of the calibrating cDNA with the two fluorophores and adding 
identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood, MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping emission 
spectra) between the fluorophores using each fluorophore's emission spectrum. 

A grid is superimposed ova- the fluorescence signal image such that the signal from each spot 
is centered in each element of the grid. The fluorescence signal within each element is then integrated to 
obtain a numerical value corresponding to the average intensity of the signal. The software used for 
signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 
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XII. Complementary Nucleic Acids 

Sequences complementary to the dithp are used to detect, decrease* or inhibit expression of the 
naturally occurring nucleotide. The use of oligonucleotides comprising from about 15 to 30 base pairs 
is typical in the art -However, smaller or largo: sequence fragments can also be used Appropriate 
oligonucleotides are designed from the dithp using OLIGO 4.06 software (National Biosciences) or 
other appropriate programs and are synthesized using methods standard in the art or ordered from a 
commercial supplier. To inhibit transcription, a complementary oligonucleotide is designed from the 
most unique 5 9 sequence and used to prevent transcription factor binding to the promoter sequence. To 
inhibit translation, a complementary oligonucleotide is designed to prevent ribosomal binding and 
processing of the transcript 

XIII. Expression of DITHF 

Expression and purification of DITHP is accomplished using bacterial or virus-based 
expression systems. For expression of DITHP in bacteria, cDNA is subcloned into an appropriate 
vector containing an antibiotic resistance gene and an inducible promoter that directs high levels of 
cDNA transcriptioa Examples of such promoters include, but are not limited to, the trp4ac (tac) 
hybrid promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator 
regulatory element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., 
BL21 (DE3). Antibiotic resistant bacteria express DITHP upon induction with isopropyl beta-D- 
thiogalactopyranoside (EPTG). Expression of DITHP in eukaryotic cells is achieved by infecting insect 
or mammalian ceil lines with recombinant Autoeraphica califomica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding DITHP by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodoptera frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modifications to baculovirus. (See e.g., Engelhard, 
supra : and Sandig, supra .) 

In most expression systems, DITHP is synthesized as a fusion protein with, e.g., glutathione S- 
transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 
enzyme from Schistosoma iaponicum. enables the purification of fusion proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolytically cleaved from DITHP at 
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specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoafifinity purification 
using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak 
Company, Rochester NY). 6-His, a stretch of six consecutive histidine residues, enables purification on 
metal-cheiate resins (QIAGEN). Methods for protein egression and purification are discussed in 
5 Ausubei (1995, supra , Chapters 10 and 16). Purified DITHP obtained by these methods can be used 
directly in the following activity assay. 

XIV. Demonstration of DITHP Activity 

DITHP activity is demonstrated through a variety of specific assays, some of which are 
10 outlined below. 

Oxidoreductase activity of DITHP is measured by the increase in extinction coefficient of 
NAD(P)H coenzyme at 340 nmior the measurement of oxidation activity, or the decrease in extinction 
coefficient of NAD(P)H coenzyme at 340 nmfor the measurement of reduction activity (Dalziel, K. 
(1963) J. Biol. Chem. 238:2850-2858). One of three substrates may be used: Asn-PGal, biocytidine, or 
15 ubiquinone^lO. The respective subunits of the enzyme reaction, for example, cytocbtame c r b 
oxidoreductase and cytochrome c, are reconstituted. The reaction mixture contains a) 1-2 mg/ml 
DITHP; and b) 15 mM substrate, 2.4 mM NAD(P) + in 0.1 M phosphate buffer, pH 7.1 (oxidation 
reaction), or 2.0 mM NAD(P)H, in 0. 1 M Na^HPC^ buffer, pH 7.4 ( reduction reaction); in a total 
volume of 0. 1 ml. Changes in absorbance at 340 nm (A^ are measured at 23.5 ° C using a recording 

2 o spectrophotometer (Shimadzu Scientific Instruments, Inc., Pleasanton CA). Hie amount of NAD(P)H 

is stoichiometrically equivalent to the amount of substrate initially present, and the change in A^ is a 
direct measure of the amount of NAD(P)H produced; AA^ ~ 6620 [NADH]. Oxidoreductase activity 
of DITHP activity is proportional to the amount of NAD(P)H present in the assay. 

Transferase activity of DITHP is measured through assays such as a methyl transferase assay 
25 in which the transfer of radiolabeled methyl groups between a donor substrate and an acceptor 

substrate is measured (Bokar, J. A et al. (1994) J. Biol. Chem. 269: 17697-17704), Reaction mixtures 
(50 pi final volume} contain 15 mM HEPES, pH 7.9, 1.5 mM MgCl 2 , 10 mM dithiothreitol, 3% 
polyvinylalcohol, 1.5 \iCi [mfitfiyZ- 3 H]AdoMet (0.375 pM AdoMet) (DuPont-NEN), 0.6 *ig DITHP, 
and acceptor substrate (0.4 \xg [ 33 S]RNA or 6-mercaptopurine (6-MP) to 1 mM final concentration). 

3 o Reaction mixtures are incubated at 30°C for 30 minutes, then 65 °C for 5 minutes. Hie products are 

separated by chromatography or electrophoresis and the level of methyl transferase activity is 
determined by quantification of methyl-*R recovery. 

DITHP hydrolase activity is measured by the hydrolysis of appropriate synthetic peptide 
substrates conjugated with various chromogenic molecules in which the degree of hydrolysis is 
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quantified by spectrophotometric (or fluorometric) absorption of the released chromophore. (Beynon, 
RJ. and J.S. Bond (1994) Proteolytic Enzymes: A Practical Approach , Oxford University Press, New 
York NY, pp. 25-55) Peptide substrates are designed according to the category of protease activity as 
endopeptidase (serine, cysteine, aspartic proteases), animqpeptidase (leucine aminopeptidase), or 
carboxypeptidase (Carboxypeptidase A and B, procollagen C-protdnase). 

DITHP isomerase activity such as peptidyl prolyl cis/trans isomerase activity can be assayed 
by an enzyme assay described by Rahfeld, J.U., et al. (1994) (FEBS Lett 352: 180-184). The assay 
is performed at 10°C in 35 mM HEPES buffer, pH 7.8, containing chymotrypsin (0.5 mg/ml) and 
DITHP at a variety of concentrations. Under these assay conditions, the substrate, Suc-Ala-Xaa-Pro- 
Phe-4-NA, is in equilibrium with respect to the prolyl bond, with 80-95% in trans and 5-20% in cis 
conformation. An aliquot (2 ul) of the substrate dissolved in dimethyl sulfoxide (10 mg/ml) is added 
to the reaction mixture described above. Only the cis isomer of the substrate is a substrate for 
cleavage by chymotrypsin. Thus, as the substrate is isomerized by DITHP, the product is cleaved by 
chymotrypsin to produce 4-nitroanilide, which is detected by it's absorbance at 390 nm. 4- 
Nitroanilide appears in a time-dependent and a DITHP concentration-dependent manna:. 

An assay for DITHP activity associated with growth and development measures cell 
proliferation as the amount of newly initiated DNA synthesis in Swiss mouse 3T3 cells. A plasmid 
containing polynucleotides encoding DITHP is transfected into quiescent 3T3 cultured cells using 
methods well known in the art. The transiently transfected cells are then incubated in the presence of 
[ 3 H]thymidine, a radioactive DNA precursor. Where applicable, varying amounts of DITHP ligand are 
added to the transfected cells. Incorporation of ^thymidine into acid-precipitable DNA is measured 
over an appropriate time interval, and the amount incorporated is directly proportional to the amount of 
newly synthesized DNA. 

Growth factor activity of DITHP is measured by the stimulation of DNA synthesis in Swiss 
mouse 3T3 ceils (McKay, I. and I. Leigh, eds. (1993) Growth Factors: A Practical Approach , Oxford 
University Press, New York NY). Initiation of DNA synthesis indicates the cells' entry into the mitotic 
cycle and their commitment to undergo later division 3T3 cells are competent to respond to most 
growth factors, not only those that are mitogenic, but also those that are involved in embryonic 
induction. This competence is possible because the in vivo specificity demonstrated by some growth 
factors is not necessarily inherent but is determined by the responding tissue. In this assay, varying 
amounts of DITHP are added to quiescent 3T3 cultured cells in the presence of pHJthymidine, a 
radioactive DNA precursor. DITHP for this assay can be obtained by recombinant means or from 
biochemical preparations. Incorporation of [ 3 H]thymidine into acid-precipitable DNA is measured over 
an appropriate time interval, and the amount incorporated is directly proportional to the amount of 
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newly synthesized DNA A linear dose-response curve over at least a hundred-fold DITHP 
concentration range is indicative of growth factor activity. One unit of activity per milliliter is defined 
as the concentration of DITHP producing a 50% response level, where 100% represents maximal 
incorporation of [ 3 H]thymidine into acid-precipitable DNA. 
5 Alternatively, an assay for cytokine activity of DITHP measures the proliferation of 

leukocytes. In this assay, the amount of tritiated thymidine incorporated into newly synthesized DNA 
is used to estimate proliferative activity. Varying amounts of DITHP are added to cultured 
leukocytes, such as granulocytes, monocytes, or lymphocytes, in the presence of [ 3 H]thymidine, a 
radioactive DNA precursor. DITHP for this assay can be obtained by recombinant means or from 

10 biochemical preparations. Incorporation of [ 3 H]thymidine into acid-precipitable DNA is measured 
over an appropriate time interval, and the amount incoiporated is directly proportional to the amount 
of newly synthesized DNA. A linear dose-response curve over at least a hundred-fold DITHP 
concentration range is indicative of DITHP activity. One unit of activity per milliliter is 
conventionally defined as the concentration of DITHP producing a 50% response level, where 100% 

15 represents maximal incorporation of [ 3 H]thymidine into acid-precipitable DNA 

An alternative assay for DITHP cytokine activity utilizes a Boyden micro chamber 
(Neuroprobe, Cabin John MD) to measure leukocyte chemotaxis (Vicari, supra) . In this assay, about 
10 5 migratory cells such as macrophages or monocytes are placed in cell culture media in the upper 
compartment of the chamber. Varying dilutions of DITHP are placed in the lower compartment The 

2 o two compartments are separated by a 5 or ? micron pore polycarbonate filter (Nucleopore, Pleasanton 
CA). After incubation at 37 °C for 80 to 120 minutes, the filters are fixed in methanol and stained 
with appropriate labeling agents. Cells which migrate to the other side of the filter are counted using 
standard microscopy. The chemotactic index is calculated by dividing the number of migratory cells 
counted when DITHP is present in the lower compartment by the number of migratory cells counted 

2 5 when only media is present in the lower compartment The chemotactic index is proportional to the 
activity of DITHP. 

Alternatively, cell lines or tissues transformed with a vector containing dithp can be assayed for 
DITHP activity by immunoblotting. Cells are denatured in SDS in the presence of p-mercaptoethanol, 
nucleic acids removed by ethanol precipitation, and proteins purified by acetone precipitatioa Pellets 
30 are resuspended in 20 mM tris buffer at pH 7.5 and incubated with Protein G-Sepharose pre-coated 
with an antibody specific for DITHP. After washing, the Sepharose beads are boiled in electrophoresis 
sample buffer, and the duted proteins subjected to SDS-PAGE. The SDS-PAGE is transferred to a 
nitrocellulose membrane for immunoblotting, and the DITHP activity is assessed by visualizing and 
quantifying bands on the blot using the antibody specific for DITHP as the primary antibody and 125 I- 
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labeled IgG specific for the primary antibody as the secondary antibody. 

DITHP kinase activity is measured by phosphorylation of a protein substrate using y-labeled 
[ 32 P]-ATP and quantitation of the incorporated radioactivity using a radioisotope counter. DITHP is 
incubated with the protein substrate, f^PJ-ATP, and an appropriate kinase buffer. The [ 32 P] 
5 incorporated into the product is separated from free [ 32 P]-ATP by electrophoresis and the incorporated 
pP] is counted. The amount of recovered is proportional to the kinase activity of DITHP in 
the assay. A determination of the specific amino acid residue phosphorylated is made by 
phosphoamino acid analysis of the hydrolyzed protein. 

In the alternative, DITHP activity is measured by the increase in cell proliferation resulting 

10 from transformation of a mammalian cell line such as COS7, HeLa or CHO with an eukaryotic 
expression vector encoding DITHP. Eukaryotic expression vectors are commercially available, and 
the techniques to introduce them into cells are well known to those skilled in the art The cells are 
incubated for 48-72 hours after transformation under conditions appropriate for the cell line to allow 
expression of DITHP. Phase microscopy is then used to compare the mitotic index of transformed 

15 versus control cells. An increase in the mitotic index indicates DITHP activity. 

In a further alternative, an assay for DITHP signaling activity is based upon the ability of 
GPCR family proteins to modulate G protein-activated second messenger signal transduction 
pathways (e.g., cAMP; Gaudin, P. et al. (1998) J. Biol. Chenx 273:4990-4996). Aplasmid encoding 
full length DITHP is transfected into a mammalian cell line (e.g., Chinese hamster ovary (CHO) or 

20 human embryonic kidney (HEK-293) cell lines) using methods well-known in the art Transfected 
cells are grown in 12- well trays in culture medium for 48 hours, then the culture medium is discarded, 
and the attached cells are gently washed with PBS. The cells are then incubated in culture medium 
with or without ligand for 30 minutes, then the medium is removed and cells lysed by treatment with 
1 M perchloric acid. The cAMP levels in the lysate are measured by radioimmunoassay using 

2 5 methods well-known in the art. Changes in the levels of c AMP in the lysate from cells exposed to 
ligand compared to those without ligand are proportional to the amount of DITHP present in the 
transfected cells. 

Alternatively, an assay for DITHP protein phosphatase activity measures the hydrolysis of P- 
nitrophenyl phosphate (PNPP). DITHP is incubated together with PNPP in HEPES buffer pH 7.5, in 
30 the presence of 0. 1 % (J-mercaptoethanol at 37 ° C for 60 min. The reaction is stopped by the addition of 
6 ml of 10 N NaOH, and the increase in light absorbance of the reaction mixture at 410 nm resulting 
from the hydrolysis of PNPP is measured using a spectrophotometer. The increase in light absorbance 
is proportional to the phosphatase activity of DITHP in the assay (Diamond, R.H. et al (1994) Mol Cell 
Biol 14:3752-3762). 
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An alternative assay measures DITHP-mediated G-protein signaling activity by monitoring 
the mobilization of Ca** as an indicator of the signal transduction pathway stimulation (See, e.g., 
Grynkievicz, G. et al. (1985) J. Biol. Chem. 260:3440; McCoIl, S. et al. (1993) J. Immunol. 
150:4550-4555; and Aussel, C. et al. (1988) J. Immunol. 140:215-220). The assay requires 
5 preloading neutrophils or T cells with a fluorescent dye such as FURA-2 or BCECF (Universal 
Imaging Corp, Westchester PA) whose emission characteristics are altered by Ca* binding. When 
the cells are exposed to one or more activating stimuli artificially (e.g., anti-CD3 antibody ligation of 
the T cell receptor) or physiologically (e.g., by allogeneic stimulation), Ca 4 * flux takes place. This 
flux can be observed and quantified by assaying the cells in a fluorometer or fluorescent activated cell 

10 sorter. Measurements of Ca** flux are compared between cells in their normal state and those 

transfected with DITHP. Increased Ca"" mobilization attributable to increased DITHP concentration 
is proportional to DITHP activity, 

DITHP transport activity is assayed by measuring uptake of labeled substrates into Xenopus 
laevis oocytes. Oocytes at stages V and VI are injected with DITHP mRNA (10 ng pa* oocyte) and 

15 incubated for 3 days at 18°C in OR2 medium (82.5mM NaCl, 2.5 mM KC1, ImM CaCl 2 , ImM MgCl 2 , 
ImM NaJHDPC^, 5 mM Hepes, 3.8 mM NaOH, 50\ig/nH gentamycin, pH 7.8) to allow expression of 
DITHP protein Oocytes are then transferred to standard uptake medium (lOOmM NaCl, 2 mM KC1, 
ImM CaCl* ImM MgCl 2 , 10 mM Hepes/Tris pH 7.5). Uptake of various substrates (e.g., amino 
acids, sugars, drugs, ions, and neurotransmitters) is initiated by adding labeled substrate (e.g. 

2 o radiolabeled with 3 H, fluorescently labeled with rhodamine, etc.) to the oocytes. After incubating for 30 
minutes, uptake is terminated by washing the oocytes three times in Na + -free medium, measuring the 
incorporated label, and comparing with controls. DITHP transport activity is proportional to the level 
of internalized labeled substrata 

DITHP transferase activity is demonstrated by a test for galactosyltransferase activity. Hiis 

25 can be determined by measuring the transfer of radiolabeled galactose from UDP-galactose to a 

GlcNAc-tenninated oligosaccharide chain (Kolbinger, F. et al. (1998) J. Biol Chem. 273:58-65). The 
sample is incubated with 14 \il of assay stock solution (180 mM sodium cacodylate, pH 6.5, 1 mg/ml 
bovine serum albumin, 0.26 mM UDP-galactose, 2 jil of UDP-pH]galactose), 1 nl of MnCl* (500 
mM), and 2.5 ^1 of GlcNAcpO-(CH*) 8 -C0 2 Me (37 mg/ml in dimethyl sulfoxide) for 60 minutes at 

30 37 °C. The reaction is quenched by the addition of 1 ml of water and loaded on a CI 8 Sep-Pak 
cartridge (Waters), and the column is washed twice with 5 ml of water to remove unreacted UDP- 
PHJgalactose The pH]galactosylated GlcNAcpO^CH^-COjMe remains bound to the column during 
the water washes and is eluted with 5 ml of methanol. Radioactivity in the eluted material is measured 
by liquid scintillation counting and is proportional to galactosyltransferase activity in the starting 
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sample. 

In the alternative, DITHP induction by heat or toxins may be demonstrated using primary 
cultures of human fibroblasts or human cell lines such as CCL-13, HEK293, or HEP G2 (ATCC). To 
heat induce DITHP expression, aliquots of cells are incubated at 42 °C for 15, 30, or 60 minutes. 
5 Control aliquots are incubated at 37 °C for the same time periods. To induce DITHP expression by 
toxins, aliquots of cells are treated with 100 \M arsenite or 20 mM azetidine-2-carboxylic acid for 0, 
3, 6, or 12 hours. After exposure to heat, arsenite, or the amino acid analogue, samples of the treated 
cells are harvested and cell lysates prepared for analysis by western blot Cells are ly sed in lysis 
buffer containing 1% Nonidet P-40, 0.15 M NaCl, 50 mM Tris-HQ, 5 mM EDTA, 2 mM 

10 N-ethylmaleimide, 2 mM phenylmethylsulfonyl fluoride, 1 mg/ml leupeptin, and 1 mg/ml pepstatin. 
Twenty micrograms of the cell lysate is separated on an 8% SDS-PAGE gel and transferred to a 
membrane. After blocking with 5% nonfat dry milk.'phosphate-buffered saline for 1 h, the membrane 
is incubated overnight at 4°C or at room temperature for 2-4 hours with a 1 : 1000 dilution of 
anti-DITHP serum in 2% nonfat dry milk/phosphate-buffered saline. The membrane is then washed 

is and incubated with a 1:1000 dilution of horseradish peroxidase-conjugated goat anti-rabbit IgG in 2% 
dry milk/phosphate-buffered saline. After washing with 0. 1 % Tween 20 in phosphate-buffered saline, • 
the DITHP protein is detected and compared to controls using chemiluminescence. 

Alternatively, DITHP protease activity is measured by the hydrolysis of appropriate synthetic 
peptide substrates conjugated with various chromogenic molecules in which the degree of hydrolysis 

20 is quantified by spectrophotometric (or fluorometric) absorption of the released chromophore 

(Beynon, R.J. and J.S. Bond (1994) Proteolytic Enzymes: A Practical Approach. Oxford University 
. Press, New York, NY, pp.25-55). Peptide substrates are designed according to the category of 
protease activity as endopeptidase (serine, cysteine, aspartic proteases, or metalloproteases), 
aminopeptidase (leucine aminopeptidase), or carboxypeptidase (carboxypeptidases A and B, 

25 procollagen C-proteinase). Commonly used chromogens are 2-naphthylamine, 4-nitroanUine, and 
furylacrylic acid. Assays are performed at ambient temperature and contain an aliquot of the enzyme 
and the appropriate substrate in a suitable buffer. Reactions are carried out in an optical cuvette, and 
the increase/decrease in absorbance of the chromogen released during hydrolysis of the peptide 
substrate is measured. The change in absorbance is proportional to the DITHP protease activity in the 

30 assay. 

In the alternative, an assay for DITHP protease activity takes advantage of fluorescence 
resonance energy transfer (FRET) that occurs when one donor and one acceptor fluorophore with an 
appropriate spectral overlap are in close proximity. A flexible peptide linker containing a cleavage 
site specific for PRTS is fused between a red-shifted variant (RSGFP4) and a blue variant (BFP5) of 
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Green Fluorescent Protein. This fusion protein has spectral properties that suggest energy transfer is 
occurring from BFP5 to RSGFP4. When the fusion protein is incubated with DITHP, the substrate is 
cleaved, and the two fluorescent proteins dissociate. This is accompanied by a marked decrease in 
energy transfer which is quantified by comparing the emission spectra before and after the addition of 
5 DITHP (Mitra, R.D. et al (1996) Gene 173:13-17). This assay can also be performed in living cells. 
In this case the fluorescent substrate protein is expressed constitutively in cells and DITHP is 
introduced on an inducible vector so that FRET can be monitored in the presence and absence of 
DITHP (Sagot, L et al (1999) FEBS Lett. 447:53-57). 

A method to determine the nucleic acid binding activity of DITHP involves a poly acrylamide 

l o gel mobility-shift assay. In preparation for this assay, DITHP is expressed by transforming a 
mammalian cell line such as COS7, HeLa or CHO with a eukaryotic expression vector containing 
DITHP cDNA, The- cells are incubated for 48-72 hours after transformation under conditions 
appropriate for the cell line to allow expression and accumulation of DITHP. Extracts containing 
solubilized proteins can be prepared from cells expressing DITHP by methods well known in the art 

15 Portions of the extract containing DITHP are added to [ 32 P]-labeled RNA or DNA Radioactive 
nucleic acid can be synthesized in vitro by techniques well known in the art The mixtures are 1 
incubated at 25 °C in the presence of RNase- and DNase-inhibitors under buffered conditions for 5-10 
minutes. After incubation, the samples are analyzed by polyacrylamide gel electrophoresis followed 
by autoradiography. Hie presence of a band on the autoradiogram indicates the formation of a 

20 complex between DITHP and the radioactive transcript. A band of similar mobility will not be 
present in samples prepared using control extracts prepared from untransformed cells. 

In the alternative, a method to determine the methylase activity of a DITHP measures transfer 
of radiolabeled methyl groups between a donor substrate and an acceptor substrate. Reaction 
mixtures (50 [A final volume) contain 15 mM HEPES, pH 7.9, 1.5 mM MgCl 2 , 10 mM dithiothreitol, 

25 3% polyvinylalcohol, 1.5 \id [methyl- 3 K\A&6M£t (0.375 \M AdoMet) (DuPont-NEN), 0.6 \xg 
DITHP, and acceptor substrate (e.g., 0.4 jog [ 35 S]RNA, or 6-mercaptopurine (6-MP) to 1 mM final 
concentration). Reaction mixtures are incubated at 30°C for 30 minutes, then 65 °C for 5 minutes. 
Analysis of [w*tfiyf- 3 H]RNA is as follows: 1) 50 |ul of 2 x loading buffer (20 mM Tris-HQ, pH 7.6, 1 
M LiCl, 1 mM EDTA, 1 % sodium dodecyl sulphate (SDS)) and 50 pi oligo d(T)-cellulose (10 mg/ml 

30 in 1 x loading buffer) are added to the reaction mixture, and incubated at ambient temperature with 
shaking for 30 minutes. 2) Reaction mixtures are transferred to a 96- well filtration plate attached to a 
vacuum apparatus. 3) Each sample is washed sequentially with three 2.4 ml aliquots of 1 x oligo d(T) 
loading buffer containing 0.5% SDS, 0.1% SDS, or no SDS. and 4) RNA is eluted with 300 pi of 
water into a 96- well collection plate, transferred to scintillation vials containing liquid scintillant, and 
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radioactivity determined. Analysis of [m£tfiyi- 3 H]6-MP is as follows: 1) 500 pi 0.5 M borate buffer, 
pH 10.0, and then 2,5 ml of 20% (v/v) isoamyl alcohol in toluene are added to the reaction mixtures. 
2) The samples mixed by vigorous vortexing for ten seconds. 3) After centrifugation at 700g for 10 
minutes, 1.5 ml of the organic phase is transferred to scintillation vials containing 0.5 ml absolute 
5 ethanol and liquid scintillant, and radioactivity determined, and 4) Results are corrected for the 
extraction of 6-MP into the organic phase (approximately 41%). 

An assay for adhesion activity of DITHP measures the disruption of cytoskeletal filament 
networks upon overexpression of DITHP in cultured cell lines (Rezniczek, G.A. et al. (1998) J. Cell 
Biol. 141 :209-225). cDNA encoding DITHP is subcloned into a mammalian expression vector that 

10 drives high levels of cDNA expression. This construct is transfected into cultured cells, such as rat 
kangaroo PtK2 or rat bladder carcinoma 804G cells. Acta filaments and intermediate filaments such 
as keratin and vimentin are visualized by immunofluorescence microscopy using antibodies and 
techniques well known in the art The configuration and abundance of cytoskeletal filaments can be 
assessed and quantified using confocal imaging techniques. In particular, the bundling and collapse 

is of cytoskeletal filament networks is indicative of DITHP adhesion activity. 

Alternatively, an assay for DITHP activity measures the expression of DITHP on the cell 
surface. cDNA encoding DITHP is transfected into a non-leukocytic cell line. Cell surf ace proteins 
are labeled with biotin (de la Fuente, MA et al. (1997) Blood 90:2398-2405). Immunoprecipitations 
are performed using DITHP-specific antibodies, and immunoprecipitated samples are analyzed using 

2 o SDS-PAGE and immunoblotting techniques. The ratio of labeled immunoprecipitant to unlabeled 
immunoprecipitant is proportional to the amount of DITHP expressed on the cell surface. 

Alternatively, an assay for DITHP activity measures the amount of cell aggregation induced 
by overexpression of DITHP. In this assay, cultured cells such as NIH3T3 are transfected with cDNA 
encoding DITHP contained within a suitable mammalian expression vector under control of a strong 

2 5 promoter. Cotransfection with cDNA encoding a fluorescent marker protein, such as Green 

Fluorescent Protein (CLONTECH), is useful for identifying stable transfectants. The amount of cell 
agglutination, or clumping, associated with transfected cells is compared with that associated with 
untransfected cells. The amount of cell agglutination is a direct measure of DITHP activity. 

DITHP may recognize and precipitate antigen from serum. This activity can be measured by 

30 the quantitative precipitin reaction (Golub, E.S. et al. (1987) Immunology: A Synthesis. Sinauer 
Associates, Sunderland MA, pages 113-115). DITHP is isotopically labeled using methods known in 
the art. Various serum concentrations are added to constant amounts of labeled DITHP. DITHP- 
antigen complexes precipitate out of solution and are collected by centrifugation. The amount of 
precipitable DITHP-anligen complex is proportional to the amount of radioisotope detected in the 
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precipitate. The amount of precipitable DrTHP-antigen complex is plotted against the serum 
concentration. For various serum concentrations, acharacteristic precipitation curve is obtained, in 
which the amount of precipitable DrTHP-antigen complex initially increases proportionately with 
increasing serum concentration, peaks at the equivalence point, and then decreases proportionately with 
5 further increases in serum concentrator Thus, the amount of precipitable DITHP-antigen complex is 
a measure of DITHP activity which is characterized by sensitivity to both touting and excess quantities 
of antigen. 

A microtubule motility assay for DITHP measures motor protein activity. In this assay, 
recombinant DITHP is immobilized onto a glass slide or similar substrate. Taxol-stabilized bovine 
1 o brain microtubules (commercially available) in a solution containing ATP and cytosolic extract are 
perfused onto the slide. Movement of microtubules as driven by DITHP motor activity can be 
visualized and quantified using video-enhanced light microscopy and image analysis techniques. 
DITHP motor protein activity is directly proportional to the frequency and velocity of microtubule 
movement. 

l s Alternatively, an assay for DITHP measures the formation of protein filaments in vitro . A 

solution of DITHP at a concentration greater than the "critical concentration" for polymer assembly is 
applied to carbon-coated grids. Appropriate nucleation sites may be supplied in the solution. The 
grids are negative stained with 0.7% (w/v) aqueous uranyl acetate and examined by electron 
microscopy. The appearance of filaments of approximately 25 nm (microtubules), 8 nm (actin), or 10 

20 nm (intermediate filaments) is a demonstration of protein activity. 

DITHP electron transfer activity is demonstrated by oxidation or reduction of NADP. 
Substrates such as Asn-PGal, biocytidine, or ubiquinone-10 may be used. The reaction mixture 
contains 1-2 mg/ml HORP, 15 mM substrate, and 2.4 mM NAD(P) + in 0.1 M phosphate buffo*, pH 
7.1 (oxidation reaction), or 2.0 mM NAD(P)H, in 0.1 M NajHPC^ buffer, pH 7.4 (reduction reaction); 

25 in a total volume of 0.1 ml. FAD may be included with NAD, according to methods well known in the 
art. Changes in absorbance are measured using a recording spectrophotometer. The amount of 
NAD(P)H is stoichiometrically equivalent to the amount of substrate initially present, and the change in 
A^ is a direct measure of the amount of NAD(P)H produced; AA^ = 6620[NADH]. DITHP activity 
is proportional to the amount of NAD(P)H present in the assay. The increase in extinction coefficient 

30 of NAD(P)H coenzyme at 340 nm is a measure of oxidation activity, or the decrease in extinction 
coefficient of NAD(P)H coenzyme at 340 nm is a measure of reduction activity (Dalziel, K. (1963) J. 
Biol. Chem. 238:2850-2858). 

DITHP transcription factor activity is measured by its ability to stimulate transcription of a 
reporter gene (Liu, H.Y. et al. (1997) EMBO J. 16:5289-5298). The assay entails the use of a well 
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characterized reporter gene construct, LexA^-LacZ, that consists of LexA DN A transcriptional control 
elements (LexAop) fused to sequences encoding the E. coli LacZ enzyme. Hie methods for constructing 
and expressing fusion genes, introducing them into cells, and measuring LacZ enzyme activity, are well 
known to those skilled in the art Sequences encoding DTTHP are cloned into a plasmid that directs the 
5 synthesis of a fusion protein, LexA-DITHP, consisting of DITHP and a DNA binding domain derived 
from the LexA transcription factor. The resulting plasmid, encoding a LexA-DITHP fusion protein, is 
introduced into yeast cells along with a plasmid containing the LexA^-LacZ reporter gene. The amount 
of LacZ enzyme activity associated with LexA-DITHP transacted cells, relative to control cells, is 
proportional to the amount of transcription stimulated by the DITHP. 

1 o Chromatin activity of DITHP is demonstrated by measuring sensitivity to DNase I (Dawson, 

B.A et aL (1989) J. Biol. Chem. 264:12830-12837). Samples are treated with DNase I, followed by 
insertion of a cleavable biotinylated iiucleoiide analog, 5-[(N-biounainido)hexanoamid 
thiopropionyl-S-aminoaUyl]^' -deoxyuridine 5 ' -triphosphate using nick-repair techniques well known 
to those skilled in the art Following purification and digestion with EcoRI restriction endonuclease, 
1 5 biotinylated sequences are affinity isolated by sequential binding to streptavidin and biotincellulose. 
Another specific assay demonstrates the ion conductance capacity of DITHP using an 
electrophysiological assay. DITHP is expressed by transforming a mammalian cell line such as 
COS7, HeLa or CHO with a eukaryotic expression vector encoding DITHP. Eukaryotic expression 
vectors are commercially available, and the techniques to introduce them into cells are well known to 

2 o those skilled in the art. A small amount of a second plasmid, which expresses any one of a number of 

marker genes such as P-galactosidase, is co-transformed into the cells in order to allow rapid 
identification of those cells which have taken up and expressed the foreign DNA. Hie cells are 
incubated for 48-72 hours after transformation under conditions appropriate for the cell line to allow 
expression and accumulation of DITHP and p-galactosidase. Transformed cells expressing p- 

2 5 galactosidase are stained blue when a suitable colorimetric substrate is added to the culture media 

under conditions that are well known in the art Stained cells are tested for differences in membrane 
conductance due to various ions by electrophysiological techniques that are well known in the art. 
Untransformed cells, and/or cells transformed with either vector sequences alone or p-galactosidase 
sequences alone, are used as controls and tested in parallel. The contribution of DITHP to cation or 

3 o anion conductance can be shown by incubating the cells using antibodies specific for either DITHP. 

Trie respective antibodies will bind to the extracellular side of DITHP, thereby blocking the pore in 
the ion channel, and the associated conductance. 

XV. Functional Assays 
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DITHP function is assessed by expressing dithp at physiologically elevated levds in 
mammalian cell culture systems. cDNA is subcloned into a mammalian expression vector containing a 
strong promote: that drives high levels of cDNA expression Vectors of choice include pCMV SPORT 
(Life Technologies) and pCR3. 1 (Invitrogen Corporation Carlsbad CA), both of which contain the 
5 cytomegalovirus promoter. 5-10 ng of recombinant vector are transiently transfected into a human cell 
line, preferably of endothelial or hematopoietic origin, using either liposome formulations or 
electroporation 1-2 pig of an additional plasmid containing sequences encoding a marker protein are 
co-transfected 

Expression of a marker protean provides a means to distinguish transfected cells from 

1 o nontr ansfected cells and is a reliable predictor of cDNA expression from the recombinant vector. 

Marker proteins of choice include, e,g„ Green Fluorescent Protein (GFP; CLONTECH), CD64, or a 
CD64-GFP fusion protean. Row cytometry (FCM), an automated laser oplics-based technique, is used 
to identify transfected cells expressing GFP or CD64-GFP and to evaluate the apqptotic state of the 
cells and other cellular properties. 
1 5 FCM detects and quantifies the uptake of fluorescent molecules that diagnose events preceding 

or coincident with cell death. These events include changes in nuclear DNA content as measured by 
staining of DNA with prqpidium iodide; changes in cell size and granularity as measured by forward 
light scatter and 90 degree side light scatter; down-regulation of DNA synthesis as measured by 
decrease in bromodeoxyuridine uptake; alterations in expression of cell surface and intracellular 

2 o proteins as measured by reactivity with specific antibodies; and alterations in plasma membrane 

composition as measured by the binding of fhiorescein-conjugated Annexin V protein to the cell 
surface. Methods in flow cytometry are discussed in Ormerod, M. G. (1994) Flow Cytometry . Oxford, 
New York NY. 

The influence of DITHP on gene expression can be assessed using highly purified populations 
25 of cells transfected with sequences encoding DITHP and either CD64 or CD64-GFP. CD64and 
CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions of human 
inununoglobulin G (IgG). Transfected cells are efficiently separated from nontr ansfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Inc., Lake Success 
NY). mRNA can be purified from the cells using methods well known by those of skill in the art 

3 o Expression of mRNA encoding DITHP and other genes of interest can be analyzed by northern analysis 

or microarray techniques, 

XVI, Production of Antibodies 

DITHP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
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Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or otto purification techniques, is used to 

immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the DITHP amino acid sequence is analyzed using LASERGENE software 

(DNASTAR) to determine regions of high immunogenicity, and a corresponding peptide is synthesized 
5 and used to raise antibodies by means known to those of skill in the art Methods for selection of 

appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well described in 

the art. (See, e.g., Ausubd, 1995, supra . Chapter 11.) 

Typically, peptides 15 residues in length are synthesized using an ABI 431 A peptide 

synthesizer (Applied Biosystems) using fmoc-chemistry and coupled to KLH (Sigma) by reaction with 
10 N-maldmidobmzoyl-N-hydroxysuccinhriide ester (MBS) to increase immunogenicity. (See, e.g., 

Ansubel, supra .) Rabbits are immunized with the peptide-KLH complex in complete Freund's 

adjuvant. Resulting antisera are tested for antipeptide activity by, for example, binding the peptide to 

plastic, blocking with 1 % BS A, reacting with rabbit antisera, washing, and reacting with radio- 

iodinated goat anti-rabbit IgG. Antisera with antipeptide activity are tested for anti-DITHP activity 
1 5 using protocols well known in the art, including ELIS A, RIA, and immunoblotting. 

XVII. Purification of Naturally Occurring DITHP Using Specific Antibodies 
Naturally occurring or recombinant DITHP is substantially purified by immunoaffinity 

chromatography using antibodies specific for DITHP. An immunoaffinity column is constructed by 
2 o covalently coupling anti-DITHP antibody to an activated chromatographic resin, such as 

CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

Media containing DITHP are passed over the immunoaffinity column, and the column is 
washed under conditions that allow the preferential absorbance of DITHP (e.g., high ionic strength 

2 5 buffers in the presence of detergent). The column is eluted undo* conditions that disrupt 

anfibody/DITHP binding (e.g., a buffo* of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and DITHP is collected. 

XVIII. Identification of Molecules Which Interact with DITHP 

3 o DITHP, or biologically active fragments thereof, are labeled with 125 I Bolton-Hunter reagent 

(See, e.g., Bolton, A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 
previously arrayed in the wells of a multi-well plate are incubated with the labeled DITHP, washed, and 
any wells with labeled DITHP complex are assayed. Data obtained using different concentrations of 
DITHP are used to calculate values for the number, affinity, and association of DITHP with the 
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candidate molecules. 

Alternatively, molecules interacting with DITHP are analyzed using the yeast two-hybrid 
system as described in Fields, S. and 0. Song (1989) Nature 340:245-246, or using commercially 
available kits based on the two-hybrid system, such as the MATCHMAKER system (CLONTECH). 

DITHP may also be used in the PATHCALLDSfG process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-tbrougbput manner to determine all interactions 
between the protons encoded by two large libraries of genes (Nandabalan, K et al. (2000) U.S. Patent 
No. 6,057,101). 

All publications and patents mentioned in the above specification are herein incorporated by 
reference. Various modifications and variations of the described method and system of the invention 
will be apparent to those skilled in the art without departing uuw the scope and spirit of the inventioa 
Although the invention has been described in connection with specific preferred embodiments, it should 
be understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the above-described modes for carrying out the invention which are 
obvious to those skilled in the field of molecular biology or related fields are intended to be within the 
scope of the following claims. 
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TABLE 3 





Template ID 


Start 


Stop 


Frame 


Domain 


Topology 


SEQ ID NO: 










Type 




1 


LG:1040582.1 :2000FEB18 


31 


117 


forward 1 


TM 


Nin 


1 


LG:1040582.1 :2000FEB18 


319 


405 


forward 1 


TM 


Nin 


1 


LG:1040582.1 :2000FEB18 


108 


155 


forward 3 


TM 


N out 


2 


LG:453570.1:2000FEB18 


361 


447 


forward 1 


TM 


Nin 


3 


LG:408751.3:2000FEB18 


1318 


1404 


forward 1 


TM 


Nin 


3 


LG:408751.3:2000FEB18 


1025 


1099 


forward 2 


TM 


Nin 


3 


LG:408751.3:2000FEB18 


1298 


1360 


forward 2 


TM 


Nin 


3 


LG:408751.3:2000FEB18 


1379 


1441 


forward 2 


TM 


Nin 


3 


LG:408751.3:2000FEB18 


1463 


1537 


forward 2 


TM 


Nin 


3 


LG:408751.3:2000FEB18 


1047 


1133 


forward 3 


TM 


Nin 


3 


LG:408751.3:2000FEB18 


1266 


1352 


forward 3 


TM 


Nin 


3 


LG:408751 .3:2000FEB1 8 


1419 


1469 


forward 3 


TM 


Nin 


4 


LI:090574.1:2000FEB01 


79 


144 


forward 1 


TM 


Nin 


4 


LI:090574.1:2000FEB01 


607 


678 


forward 1 


TM 


Nin 


4 


LI:090574.1:2000FEB01 


1009 


1080 


forward 1 


TM 


Nin 


4 


LI:090574.1:2000FEB01 


497 


583 


forward 2 


TM 


N out 


4 


LI:O90574.1:20OOFEBO1 


743 


829 


forward 2 


TM 


Nout 


4 


LI:090574.1:2000FEB01 


1026 


1085 


forward 3 


TM 


Nout 


5 


LI:229932.2:2000FEB01 


76 


162 


forward 1 


TM 


Nout 


5 


LI:229932.2:2000FEB01 


190 


276 


forward 1 


TM 


Nout 


5 


LI:229932.2:2000FEB01 


1237 


1323 


forward 1 


TM 


Nout 


5 


LI:229932.2:2000FEB01 


68 


142 


forward 2 


TM 


Nin 


5 


LI:229932.2:2000FEB01 


335 


412 


forward 2 


TM 


Nin 


5 


LI:229932.2:2000FEB01 


758 


844 


forward 2 


TM 


Nin 


5 


LI:229932.2:2000FEB01 


1229 


1288 


forward 2 


TM 


Nin 


5 


LI:229932.2:2000FEB01 


60 


146 


forward 3 


TM 


Nin 


5 


LI:229932.2:2000FEB01 


216 


302 


forward 3 


TM 


Nin 


5 


LI:229932.2:2000FEB01 


690 


752 


forward 3 


TM 


Nin 


5 


LI:229932.2:2000FEB01 


765 


827 


forward 3 


TM 


Nin 


5 


LI:229932.2:2000FEB01 


1209 


1289 


forward 3 


TM 


Nin 


6 


LI:332176.1:2000FEB01 


343 


399 


forward 1 


TM 


Nin 


6 


LI:332176.1:2000FEB01 


1078 


1131 


forward 1 


TM 


Nin 


6 


LI:332176.1:2000FEB01 


1606 


1692 


forward 1 


TM 


Nin 


6 


LI:332176.1:2000FEB01 


2218 


2274 


forward 1 


TM 


Nin 


6 


LI:332176.1:2000FEB01 


2383 


2433 


forward 1 


TM 


Nin 


6 


LI:332176.1:2000FEB01 


110 


196 


forward 2 


TM 


Nin 


6 


LI:332176.1:2000FEB01 


1307 


1378 


forward 2 


TM 


Nin 


6 


11:3321 76.1 :2000FEB01 


1640 


1726 


forward 2 


TM 


Nin 


6 


LI:332176.1:2000FEB01 


1946 


2005 


forward 2 


TM 


Nin 


6 


LI:332176.1:2000FEB01 


135 


200 


forward 3 


TM 


Nin 


6 


LI:332176.1:2000FEB01 


693 


752 


forward 3 


TM 


Nin 


6 


LI:332176.1:2000FEB01 


777 


839 


forward 3 


TM 


Nin 


6 


LI:332176.1:2000FEB01 


867 


929 


forward 3 


TM 


Nin 


6 


LI:332176.1:2000FEB01 


1035 


1118 


forward 3 


TM 


Nin 


b 


Ll.oo^l ft). l.ZUUUrcDUl 


1 1 /O 


1 2bo 


forward 3 


1 M 


n in 


6 


LI:332176.1:2000FEB01 


1572 


1658 


forward 3 


TM 


Nin 


6 


LI:332176.1:2000FEB01 


2121 


2180 


forward 3 


TM 


Nin 


6 


1.1:3321 76.1 :2000FEB01 


2277 


2363 


forward 3 


TM 


Nin 


6 


LI:332176.1:2000FEB01 


2400 


2456 


forward 3 


TM 


Nin 


8 


LG:220992.1 :2000MAY1 9 


343 


393 


forward 1 


TM 




8 


LG-.220992.1 2000MAY1 9 


646 


732 


forward 1 


TM 




8 


LG:220992.1 :2000MAY19 


1639 


1725 


forward 1 


TM 




8 


LG:220992.1 :2000MAY19 


1879 


1965 


forward 1 


TM 






r 


194 
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8 


LG.220992.1 .2000MAY1 9 


2005 


2088 


forward 1 


TM 




8 


LG:220992.1 :2000MAY1 9 


17 


76 


forward 2 


TM 


Nin 


8 


LG:220992.1:2000MAY19 


1646 


1732 


forward 2 


TM 


Nin 


8 


1X3:220992.1 2000MAY1 9 


1850 


1933 


forward 2 


TM 


Nin 


8 


LG:220992.1:2000MAY19 


1434 


1484 


forward 3 


TM 


Nout 


8 


LG.-220992.1 2000MAY1 9 


1734 


1820 


forward 3 


TM 


Nout 


8 


LG:220992.1:2000MAY19 


1974 


2036 


forward 3 


TM 


Nout 


8 


LG.220992.1 :2000MAY1 9 


2067 


2129 


forward 3 


TM 


Nout 


8 


LG:220992.1 2000MAY19 


2151 


2237 


forward 3 


TM 


Nout 


9 


LG:1 094571 .1 :2000MAY1 9 


781 


867 


forward 1 


TM 


Nin 


9 


LG:1094571.1:2000MAY19 


419 


505 


forward 2 


TM 


Nin 


9 


LG:1094571.1:2000MAY19 


767 


853 


forward 2 


TM 


Nin 


9 


LG:1094571.1:2000MAY19 


756 


842 


forward 3 


TM 


Nin 


10 


LI:350754.4:2000MAY01 


277 


348 


forward 1 


TM 


Nin 


10 


LI:350754.4:2000MAY01 


583 


651 


forward 1 


TM 


Nin 


10 


LI:350754.4:2000MAY01 


670 


747 


forward 1 


TM 


Nin 


10 


LI:350754.4:2000MAY01 


381 


467 


forward 3 


TM 


Nin 


10 


LI:350754.4:2000MAY01 


2469 


2555 


forward 3 


TM 


Nin 


12 


Ll:11 90263.1 :2000MAY01 


664 


735 


forward 1 


TM 


N in 


12 


Ll:1 190263.1 2000MAY01 


787 


861 


forward 1 


TM 


Nin 


12 


Ll:11 90263.1 :2000MAY01 


901 


954 


forward 1 


TM 


Nin 


12 


Ll:1 1 90263.1 :2000MAY01 


188 


274 


forward 2 


TM 


Nin 


12 


Ll:11 90263.1 :2000MAY01 


455 


508 


forward 2 


TM 


Nin 


12 


Ll:1 190263.1 :2000MAY01 


809 


895 


forward 2 


TM 


Nin 


12 


Ll:1 190263.1 :2000MAY01 


1616 


1663 


forward 2 


TM 


Nin 


12 


Ll:1 190263.1 :2000MAY01 


183 


251 


forward 3 


TM 


Nin 


12 


LI:1190263.1:2000MAY01 


648 


704 


forward 3 


TM 


Nin 


12 


Ll:1 190263.1 :2000MAY01 


1149 


1235 


forward 3 


TM 


Nin 


13 


LG.27091 6.2.-2000FEB1 8 


173 


259 


forward 2 


TM 


Nout 


14 


LG:99941 4.3:2000FEB1 8 


109 


195 


forward 1 


TM 


Nout 


14 


LG.99941 4.3:2000FEB1 8 


358 


438 


forward 1 


TM 


Nout 


14 


LG:999414.3:2000FEB18 


520 


591 


forward 1 


TM 


Nout 


14 


LG:999414.3:2000FEB18 


661 


744 


forward 1 


TM 


Nout 


14 


LG:999414.3:2000FEB18 


883 


969 


forward 1 


TM 


Nout 


14 


LG:999414.3:2000FEB18 


976 


1062 


forward 1 


TM 


Nout 


14 


LG:999414.3:2000FEB18 


302 


388 


forward 2 


TM 


Nin 


14 


LG.99941 4.3:2000FEB1 8 


533 


613 


forward 2 


TM 


Nin 


14 


LG:999414.3:2000FEB18 


992 


1048 


forward 2 


TM 


Nin 


14 


LG:999414.3:2000FEB18 


1169 


1246 


forward 2 


TM 


Nin 


14 


LG:999414.3:2000FEB18 


1307 


1366 


forward 2 


TM 


Nin 


14 


LG:999414.3:2000FEB18 


207 


284 


forward 3 


TM 


Nout 


14 


LG.99941 4.3:2000 FEB1 8 


324 


404 


forward 3 


TM 


Nout 


14 


LG:999414.3:2000FEB18 


540 


599 


forward 3 


TM 


Nout 


14 


LG:999414.3:2000FEB18 


1029 


1115 


forward 3 


TM 


Nout 


14 


LG:999414.3:2000FEB18 


1167 


1253 


forward 3 


TM 


Nout 


14 


LG:999414.3:2000FEB18 


1314 


1373 


forward 3 


TM 


Nout 


15 


LG:429446.1:2000FEB18 


628 


699 


forward 1 


TM 


Nout 


15 


LG:429446.1:2000FEB18 


629 


682 


forward 2 


TM 


Nin 


15 


LG:429446.1 :2000FEB1 8 


627 


713 


forward 3 


TM 


N in 


16 


LI:057229.1:2000FEB01 


10 


69 


forward 1 


TM 




16 


Ll:057229.1 :2000FEB01 


118 


198 


forward 1 


TM 




16 


LI:057229.1:2000FEB01 


292 


360 


forward 1 


TM 




16 


LI:057229.1:2000FEB01 


11 


67 


forward 2 


TM 




16 


LI:057229.1:2000FEB01 


146 


226 


forward 2 


TM 




16 


LI:057229.1:2000FEB01 


290 


355 


forward 2 


TM 




16 


LI:057229.1:2000FEB01 


12 


71 


forward 3 


TM 


Nout 




— 

> 


~95 




i 
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16 


LI:057229.1:2000FEB01 


114 


176 


forward 3 


TM 


Nout 


17 


Ll:351 965.1. 2000FEB01 


487 


573 


forward 1 


TM 




17 


Ll:351 965.1 :2000FEB01 


1036 


1098 


forward 1 


TM 




17 


Ll:351 965.1 :2000FEB01 


492 


578 


forward 3 


TM 


Nin 


17 


Ll:351 965.1 :2000FEB01 


969 


1055 


forward 3 


TM 


Nin 


17 


Ll:351 965.1 :2000FEB01 


1098 


1184 


forward 3 


TM 


Nin 


18 


LG:068682.1:2000FEB18 


707 


793 


forward 2 


TM 


Nout 


19 


LG:242665.1:2000FEB18 


10 


63 


forward 1 


TM 


Nout 


19 


LG:242665.1:2000FEB18 


12 


62 


forward 3 


TM 


Nout 


19 


LG:242665.1:2000FEB18 


333 


398 


forward 3 


TM 


Nout 


20 


LG:241743.1:2000FEB18 


43 


99 


forward 1 


TM 


Nout 


21 


LI:034212.1:2000FEB01 


1300 


1365 


forward 1 


TM 


Nin 


21 


LI:034212.1:2000FEB01 


1570 


1647 


forward 1 


TM 


Nin 


21 


LI:034212.1:2000FEB01 


2386 


2472 


forward 1 


TM 


Nin 


21 


LI:034212.1:2000FEB01 


2533 


2598 


forward 1 


TM 


Nin 


21 


LI:034212.1:2000FEB01 


2620 


2706 


forward 1 


TM 


Nin 


21 


LI:034212.1:2000FEB01 


2740 


2826 


forward 1 


TM 


Nin 


21 


LI:034212.1:2000FEB01 


719 


805 


forward 2 


TM 




21 


LI:034212.1:2000FEB01 


1205 


1291 


forward 2 


TM 




21 


LI:034212.1:2000FEB01 


1460 


1546 


• forward 2 


TM 




21 


LI:034212.1:2000FEB01 


1685 


1768 


forward 2 


TM 




21 


LI:034212.1:2000FEB01 


1814 


1882 


forward 2 


TM 




21 


LI:034212.1:2000FEB01 


2066 


2128 


forwards 


TM 




21 


LI:034212.1:2000FEB01 


2156 


2218 


forward 2 


TM 




21 


LI:034212.1:2000FEB01 


2540 


2626 


forward 2 


TM 




21 


LI:034212.1:2000FEB01 


2657 


2734 


forward 2 


TM 




21 


LI:034212.1:2000FEB01 


12 


62 


forward 3 


TM 


Nout 


21 


LI:034212.1:2000FEB01 


1236 


1301 


forward 3 


TM 


Nout 


21 


LI:034212.1:2000FEB01 


1590 


1646 


forward 3 


TM 


Nout 


21 


LI:034212.1:2000FEB01 


1668 


1721 


forward 3 


TM 


Nout 


21 


LI:034212.1:2000FEB01 


2130 


2216 


forward 3 


TM 


Nout 


21 


LI:034212.1:2000FEB01 


2295 


2381 


forward 3 


TM 


Nout 


21 


LI:034212.1:2000FEB01 


2436 


2513 


forward 3 


TM 


Nout 


21 


LI:034212.1:2000FEB01 


2538 


2624 


forward 3 


TM 


Nout 


21 


LI:034212.1:2000FEB01 


2667 


2735 


forward 3 


TM 


Nout 


22 


LG:344886.1:2000MAY19 


937 


1002 


forward 1 


TM 


Nin 


22 


LG:344886.1 :2000MAY19 


1081 


1155 


forward 1 


TM 


Nin 


22 


LG:344886.1 :2000MAY1 9 


1696 


1782 


forward 1 


TM 


Nin 


22 


LG:344886.1:2000MAY19 


413 


463 


forward 2 


TM 


Nin 


22 


LG:344886.1:2000MAY19 


551 


637 


forward 2 


TM 


Nin 


22 


LG:344886.1 :2000MAY19 


950 


1012 


forward 2 


TM 


Nin 


22 


LG:344886.1:2000MAY19 


1031 


1093 


forward 2 


TM 


Nin 


22 


LG:344886.1 :2000MAY19 


1112 


1183 


forward 2 


TM 


Nin 


22 


LG:344886.1 :2000MAY19 


1271 


1348 


forward 2 


TM 


Nin 


22 


LG:344886,1 :2000MAY1 9 


1634 


1720 


forward 2 


TM 


Nin 


22 


LG:344886.1 :2000MAY19 


567 


626 


forward 3 


TM 


Nin 


22 


LG:344886.1 :2000MAY19 


1011 


1073 


forward 3 


TM 


Nin 


22 


LG:344886.1:2000MAY19 


1089 


1151 


forward 3 


TM 


Nin 


22 


LG:344886.1 :2000MAY19 


1707 


1757 


forward 3 


TM 


Nin 


CO 




111 
1 1 1 


1fi7 
ID/ 


lUlWdlU O 


TM 


W in 

IN 111 


24 


LG:338927.1:20Q0MAY19 


934 


1020 


forward 1 


TM 


Nout 


24 


LG:338927.1:2000MAY19 


1133 


1219 


forward 2 


TM 


Nin 


24 


LG:33£927.1 :2000MAY1 9 


1170 


1250 


forward 3 


TM 


Nin 


25 


LG:898771.1:2000MAY19 


1261 


1314 


forward 1 


TM 


Nout 


25 


LG:898771.1:2000MAY19 


1397 


1450 


forward 2 


TM 


Nout 


26 


LI:257664.67:2000MAY01 


280 


366 


forward 1 


TM 


Nin 




i 


196 


I 
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26 


LI:257664.67:2000MAY01 


421 


26 


LI:257664.67:2000MAY01 


12 


27 


LI:001496.2:2000MAY01 


399 


28 


Ll:1 085273.2:2000MAY01 


2188 


28 


LI:1085273.2:2000MAY01 


503 


28 


Ll:1 085273.2:2000MAY01 


2126 


28 


LI:1085273.2:2000MAY01 


897 


29 


LI:333'138.25000MAY01 


1930 


29 


LI:333138.2:2000MAY01 


50 


29 


LI:333138.2:2000MAY01 


884 


29 


LI:333138.2:2000MAY01 


114 


29 


LI:333138.2:2000MAY01 


273 


29 


LI:333138.2:2000MAY01 


819 


29 


LI:333138.2:2000MAY01 


1581 


30 


LI:338927.1:2000MAY01 


1069 


30 


LI:338927.1:2000MAY01 


968 


30 


LI:338927.1:2000MAY01 


1056 


30 


LI:338927.1:2000MAY01 


1155 


31 


LG:335558.1:2000FEB18 


518 


31 


LG:335558.1 :2000FEB1 8 


614 


31 


LG:335558.1:2000FEB18 


761 


31 


LG:335558.1:2000FEB18 


798 


31 


LG:335558.1:2000FEB18 


882 


31 


LG:335558.1:2000FEB18 


966 


32 


LG:998283.7:2000FEB18 


1066 


32 


LG:998283.7:2000FEB18 


23 


32 


LG:998283.7:2000FEB18 


194 


32 


LG:998283.7:2000FEB1 8 


392 


32 


LG:998283.7:2000FEB18 


527 


32 


LG:998283.7:2000FEB18 


776 


32 


LG:998283.7:2000FEB1 8 


1064 


32 


LG:998283.7:2000FEB1 8 


12 


32 


LG:998283.7:2000FEB18 


147 


32 


LG:998283.7:2000FEB18 


684 


32 


LG:998283.7:2000FEB18 


1011 


33 


LI:402739.1:2000FEB01 


415 


35 


LG:981076.2:2000MAY19 


388 


35 


LG:981076.2:2000MAY19 


20 


35 


LG:981076.2:2000MAY19 


389 


35 


LG:981076.2:2000MAY19 


464 


35 


LG:981076.25000MAY19 


539 


35 


LG.981 076.2:2000MAY1 9 


438 


37 


LI:1190250.1:2000MAY01 


530 


37 


U:11 90250.1 :2000MAY01 


558 


38 


LG:021371.3:2000FEB18 


122 


41 


LG:41 0726.1 :2000FEB1 8 


22 


41 


LG:41 0726.1 :2000FEB1 8 


385 


42 


LG:200005.1:2000FEB18 


166 


42 


LG:200005.1:2000FEB18 


185 


42 


LG.200005.1 :2000FEB1 8 


162 


46 


LG:1079203.1:2000FEB18 


11 


46 


LG:1 079203.1 :2000FEB1 8 


125 


46 


LG:1079203.1:2000FEB18 


965 


47 


LG:1 082586.1 :2000FEB1 8 


256 


47 


LG: 1 082586.1 :2000FEB1 8 


248 


49 


LG:1 08,2775.1 2000FEB1 8 


553 



197 



PCT/US01/06059 



498 


forward 1 


TM 


Nin 


71 


forward 3 


TM 


Nout 


473 


forward 3 


TM 




2274 


forward 1 


TM 


Nin 


583 


forward 2 


TM 


Nout 


2194 


forward 2 


TM 


Nout 


968 


forward 3 


TM 


Nin 


2016 


forward 1 


TM 


Nout 


103 


forward 2 


TM 




940 


forward 2 


TM 




179 


forward 3 


TM 


Nout 


356 


forward 3 


TM 


Nout 


875 


forward 3 


TM 


Nout 


1667 


forward 3 


TM 


Nout 


1140 


forward 1 


TM 


Nin 


1051 


forward 2 


TM 


Nin 


1118 


forward 3 


TM 


Nout 


1217 


forward 3 


TM 


Nout 


604 


forward 2 


TM 


Nin 


682 


forward 2 


TM 


Nin 


829 


forward 2 


TM 


Nin 


860 


forward 3 


TM 


Nin 


944 


forward 3 


TM 


Nin 


1028 


forward 3 


TM 


Nin 


1146 


forward 1 


TM 


Nin 


109 


forward 2 


TM 


Nin 


280 


forward 2 


TM 


Nin 


478 


forward 2 


TM 


Nin 


613 


forward 2 


TM 


Nin 


862 


forward 2 


TM 


Nin 


1141 


forward 2 


TM 


Nin 


65 


forward 3 


TM 


Nin 


227 


forward 3 


TM 


Nin 


770 


forward 3 


TM 


Nin 


1097 


forward 3 


TM 


Nin 


501 


forward 1 


TM 


Nin 


450 


forward 1 


TM 


Nin 


82 


forward 2 


TM 


Nout 


451 


forward 2 


TM 


Nout 


526 


forward 2 


TM 


Nout 


604 


forward 2 


TM 


Nout 


524 


forward 3 


TM 


Nin 


613 


forward 2 


TM 




635 


forward 3 


TM 


Nout 


208 


forward 2 


TM 


Nin 


108 


forward 1 


TM 


Nin 


471 


forward 1 


TM 


Nin 


222 


forward 1 


TM 


Nout 


232 


forward 2 


TM 


Nout 


248 


forward 3 


TM 


Nout 


70 


forward 2 


TM 


Nin 


196 


forward 2 


TM 


Nin 


1051 


forward 2 


TM 


Nin 


339 


forward 1 


TM 


Nin 


316 


forward 2 


TM 


Nout 


606 


forward 1 


TM 


Nin 
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50 


LG:108312O.1:2000FEB18 


214 


50 


LG:1 083120.1 :2000FEB18 


233 


50 
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TABLE 5 
SEQ ID NO: Template ID 

1 LG:1040582.1:2000FEB18 

2 LG:453570.1:2000FEB18 

3 LG:408751 .3:2000FEB1 8 

4 LI:090574.t:2000FEB01 

5 LI:229932.2:2000FEB01 

6 LI:332176.1:2000FEB01 

7 LI:403248.2:2000FEB01 

8 LG:220992.1 :2000MAY19 

9 LG: 1 094571 .1 :2000MAY1 9 

1 0 LI:350754.4:2000MAY01 

1 1 LI:255828.29:2000MAY01 

1 2 U:1 1 90263.1 :2000MAY01 

1 3 LG:27091 6.2:2000FEB1 8 



1 4 LG:99941 4.3:2000FEB1 8 

1 5 LG:429446.1 :2000FEB1 8 

16 LI:057229.1:2000FEB01 

1 7 Ll:351 965.1 :2000FEB01 

1 8 LG:068682.1 :2000FEB1 8 

19 LG:242665.1:2000FEB18 

20 LG:241 743.1 :2000FEB1 8 

21 Ll:03421 2.1 :2000FEB01 

22 LG:344886.1 :2000MAY1 9 

23 LG:228930.1 :2000MAY19 

24 LG:338927.1 :2000MAY19 

25 LG:898771 .1 :2000MAY1 9 

26 LI:257664.67:2000MAY01 

27 LI:001496.2:2000MAY01 

28 LI:1085273.2:2000MAY01 

29 Ll:3331 38.2:2000MAY01 

30 Ll:338927.1 :2000MAY01 

31 LG:335558.1 :2000FEB18 



Tissue Distribution . 

Uver - 41%, Pancreas - 34%, Cardiovascular System 
-14% 

Nervous System - 100% 

Sense Organs - 63%, Nervous System - 22% 

Nervous System - 46%, Unclassified/Mixed - 36% 

Musculoskeletal System - 80% 

Urinary Tract -95% 

Respiratory System - 60%, Hemic and Immune 
System -40% 

Embryonic Structures - 17%, Male Genitalia - 12% 

Uver - 1 9%, Embryonic Structures - 1 6%, 

Cardiovascular System - 14% 

Skin - 47%, Stomatognathic System - 27%, Sense 

Oraans-14% 

Musculoskeletal System - 100% 

Urinary Tract - 80%, Urinary Tract - 1 5% 

Female Genitalia - 100% 

Embryonic Structures - 30%, Urinary Tract - 1 3%, 

Digestive System - 1 1%, Musculoskeletal System - 

11% 

Urinary Tract - 80%, Hemic and Immune System - 
20% 

Male Genitalia - 71%, Hemic and Immune System - 
29% 

Unclassified/Mixed - 53%, Male Genitalia - 12% 
Unclassified/Mixed - 49%, Male Genitalia - 27% 
Germ Cells - 47%, Female Genitalia - 13%, Male 
Genitalia -12% 

Uver - 27%, Urinary Tract - 27%, Respiratory System - 
14% 

Digestive System - 24%, Musculoskeletal System - 

22%, Nervous System - 1 1 % 

Germ Cells - 24%, Nervous System - 12% 

Embryonic Structures - 43%, Nervous System - 29%, 

Respiratory System - 14%, Hemic and immune 

Svstem-14% 

Digestive System - 23%, Unclassified/Mixed - 21%, 
Embryonic Structures - 19%, Hemic and Immune 
Svstem-19% 

Pancreas - 13%, Embryonic Structures - 1 1%, 
Female Genitalia - 10%, Urinary Tract - 10%, Hemic 
and Immune System - 10%, Cardiovascular System - 
10% 

Hemic and Immune System - 100% 

Endocrine System - 27%, Female Genitalia - 25%, 

Embryonic Structures - 25% 

Digestive System - 29%, Skin - 24%, Endocrine 

System - 16% 

Exocrine Glands - 61%, Nervous System - 13%, 
Nervous System - 11% 

Embryonic Structures - 51%, Digestive System - 17% 

Endocrine System - 45%, Nervous System - 1 8%, 
Exocrine Glands - 11% 
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32 LG:998283.7:2000FEB1 8 Sense Organs - 33%, Germ Cells - 1 8% 

Unclassified/Mixed - 78%, Male Genitalia -11%, 
33- L!:402739.1:2000FEB01 Hemic and Immune System - 1 1% 

34 LL175223.1 :2000FEB01 Embryonic Structures - 99% 

Endocrine System - 28%, Nervous System - 22%, 
Respiratory System - 17%, Female Genitalia - 17%, 

35 LG:981076.2:2000MAY19 Hemic and Immune Svstem - 17% 

36 Ll:1 008973.1 2000MAY01 Nervous System - 57%, Digestive System - 41 % 

37 Ll:1 1 90250.1 :2000MAY01 Female Genitalia - 48%, Respiratory System - 25% 

Liver - 23%, Endocrine System - 17%, Hemic and 

38 LG:021371 .3:2000FEB18 Immune System - 1 7% 

39 LG:475404.1:2000FEB18 Skin -82% 

Liver - 46%, Connective Tissue - 31%, Nervous 

40 LG:979406.2:2000FEB18 System -15% 

Embryonic Structures - 52%, Endocrine System - 

41 LG:41 0726.1 :2000FEB1 8 26% 

Unclassified/Mixed - 26%, Cardiovascular System - 

42 LG:200005.1:2000FEB18 14%, Female Genitalia - 13% 

43 LG:1076828.1:2000FEB18 Unclassified/Mixed - 69%, Urinary Tract - 25% 

Unclassified/Mixed - 63%, Musculoskeletal System - 

44 LG:1 076931. 1:2000FEB1 8 20%, Urinary Tract- 1 1% 

45 LG:1 0781 21 .1 :2000FEB1 8 Female Genitalia - 75%, Nervous System - 25% 

Female Genitalia - 42%, Cardiovascular System - 

46 LG:1 079203.1 :2000FEB1 8 33%, Hemic and Immune System - 1 7% 

47 LG:1 082586.1 :2000FEB1 8 Respiratory System- 100% 

48 LG:1 082774.1 :2000FEB1 8 Respiratory System - 50%, Female Genitalia - 50% 

49 LG:1 082775.1 :2000FEB1 8 Female Genitalia - 75%, Nervous System - 25% 

50 LG: 10831 20.1 :2000FEB1 8 Nervous System- 100% 

51 LG:1 087707.1 :2000FEB1 8 Stomatognathlc System - 98% 

Embryonic Structures - 44%, Connective Tissue - 

52 LG:1090915.1:2000FEB18 19% 

53 LG:1094230.1:2000FEB18 Female Genitalia - 100% 

Connective Tissue - 44%, Exocrine Glands - 44%, 

54 LG:474848.3:2000FEB1 8 Hemic and Immune Svstem - 1 1 % 

Nervous System - 38%, Digestive System - 38%, Male 

55 Ll:251 656.1 :2000FEB01 Genitalia - 25% 

Hemic and Immune System - 69%, Endocrine 

56 LU021 371 .1 :2000FEB01 System - 1 4% 

57 Ll:1 33095.1 :2000FEB01 Respiratory System - 67%, Nervous System - 1 3% 

Unclassified/Mixed - 30%, Respiratory System - 19%, 
Nervous System - 13%, Digestive System - 13% 

58 LI:236654.2:2000FEB01 

Unclasslfled/MIxed - 37%, Urinary Tract - 16%, 

59 Ll:200009.1 :2000FEB01 Cardiovascular System - 1 5% 

Unclassified/Mixed - 78%, Musculoskeletal System - 

60 LI:758502.1:2000FEB01 22% 

Nervous System - 56%, Skin - 27%, Connective Tissue 

61 Ll:344772.1 :2000FEB01 - 13% 

62 Ll:789445.1 :2000FEB01 Endocrine System - 1 00% 

Urinary Tract - 31%, Female Genitalia - 19%, 
Digestive System - 19%, Hemic and Immune System 

63 Ll:789657.1 :2000FEB01 - )9% 

Exocrine Glands - 44%, Female Genitalia - 33%, 
64' Ll:789808.1 :2000FEB01 Nervous System - 22% 
65 LI:792919.1:2000FEB01 Respiratory System- 100% 
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Female Genitalia - 42%, Endocrine System - 19%, 

66 Ll:793949.1 :2000FEB01 Exocrine Glands - 1 3% 

67 U.-794389.1 :2000FEB01 Endocrine System - 1 00% 

68 LI:796010.1:2000FEB01 Exocrine Glands - 100% 

69 Ll:796324.1 :2000FEB01 Female Genitalia - 100% 

70 Li:796373.1:2000FEB01 Respiratory System - 100% 

71 LI79641 5.1 :2000FEB01 Nervous System - 1 00% 

72 Ll:798636.1 :2000FEB01 Hemic and Immune System - 100% 

73 Ll:800045. 1 :2000FEB01 Female Genitalia - 60%, Male Genitalia - 40% 

74 Ll:800680.1 :2000FEB01 Cardiovascular System - 100% 

75 Ll:800894.1 :2000FEB01 Respiratory System - 50%, Digestive System - 50% 

76 LI:801015.1:2000FEB01 Male Genitalia - 100% 

77 Ll:801 236.1 :2000FEB01 Endocrine System - 100% 

78 Ll:803335.1 :2000FEB01 Connective Tissue - 100% 

Nervous System - 38%, Digestive System - 38%, Male 

79 LL803998.1 :2000FEB01 Genitalia - 25% 

80 LL478757. 1 :2000FEB01 Digestive System - 1 00% 

81 Ll:808532.1 :2000FEB01 Hemic and Immune System - 100% 

82 Li:443073.1 :2000FEB01 Digestive System - 100% 

Exocrine Glands - 80%, Hemic and Immune System 

83 Ll:479671 .1 :2000FEB01 - 20% 

84 Ll;81 0078.1 :2000FEB01 Digestive System - 1 00% 

85 Ll:81 0224.1 :2000FEB01 Digestive System - 1 00% 

Nervous System - 24%, Unclassified/Mixed - 18%, 

86 LI:817052.2:2000FEB01 Exocrine Glands - 1 4% 

Embryonic Structures - 63%, Digestive System - 30% 

87 LG:892274.1:2000MAY19 

Digestive System - 40%, Respiratory System - 30%, 

88 LG;1 080959.1 :2000MAY1 9 Hemic and Immune System - 30% 

89 LG: 1 054900.1 :2000MAY1 9 Digestive System - 1 00% 

Nervous System - 38%, Female Genitalia - 38% 

90 LG:1077357.1 :2000MAY19 Male Genitalia - 25% 

Pancreas - 31%, Digestive System - 22%, Hemic and 

91 LG:1084051.1:2000MAY19 Immune System- 16% 

Female Genitalia - 23%, Unclassified/Mixed - 23%, 
Cardiovascular System - 18%, Exocrine Glands - 

92 LG:1 076853.1 :2000MAY1 9 18% 

Female Genitalia - 22%, Nervous System - 1 7%, 

93 LG:481631.10:2000MAY19 Exocrine Glands - 1 7%, Urinary Tract- 17% 

Exocrine Glands - 67%, Cardiovascular System - 

94 LG:1 088431 .2:2000MAY1 9 33% 

Endocrine System - 18%, Embryonic Structures - 

95 LI:401619.10:2000MAY01 16%, Pancreas- 15% 

Hemic and Immune System - 27%, Female 

96 Ll:11 44007.1 :2000MAY01 Genitalia - 13% 

Endocrine System - 28%, Sense Organs - 22%, 

97 Ll:331 074.1 :2000MAY01 Connective Tissue - 10% 

98 Ll:1 170349.1 :2000MAY01 Endocrine System - 91 % 

Embryonic Structures - 24%, Musculoskeletal System 

99 LG:335097.1 :2000FEB18 - 19%, Nervous System - 16% 

100 LG:1 076451. 1:2000FEB1 8 Nervous System - 100% 

101 Ll:805478.1 :2000FEB01 Skin - 100% 

Endocrine System - 33%, Embryonic Structures - 

102 LG:101269.1:2000MAY19 33%, Urinary Tract- 30% 

1 03 Ll:331 087.1 :2000MAY01 Liver - 82%, Hemic and Immune System - 1 3% 



WO 01/62927 



PCT/US01/06059 



Cardiovascular System - 81%, Cardiovascular 

1 04 11:41 01 88.1 :2000MAY01 System - 1 2% 

1 05 U:1 1 88288.1 :2000MAY01 Nervous System - 73% 

Uver- 16%, Male Genitalia - 13%, Embryonic 

106 L!:427997.4:2000MAY01 Structures- 11% 

1 07 LG:451 682.1 :2000FEB1 8 Nen/ous System - 1 00% 

1 08 LG:1 077283.1 :2000FEB1 8 Uver - 86%, Hemic and Immune System - 1 4% 

Embryonic Structures - 41 %, Endocrine System - 

109 LG:481436.5:2000FEB18 20%, Hemic and Immune System - 13% 

Endocrine System - 43%, Urinary Tract - 36%, 

1 1 0 Ll:793701 .1 :2000FEB01 Respiratory System - 21 % 

1 1 1 Ll:373637.1 :2000FEB01 Germ Cells - 74%, Unclassified/Mixed - 1 6% 

Digestive System - 43%, Male Genitalia - 24%, 

112 LG:239368.2:2000MAY19 Endocrine System - 24% 

Germ Cells - 66%, Unclasslfled/Mixed - 22%, Male 

113 LI:053826.1:2000MAY01 Genitalia- 12% 

1 1 4 Ll:449393.1 :2000MAY01 Nervous System - 1 00% 

115 LI:1071427.96:2000MAY01 Stomatognathic System - 13% 

1 1 6 LI:336338.8:2000MAY01 Unclassified/Mixed - 55%, Connective Tissue - 26% 

Urinary Tract - 24%, Hemic and Immune System - 

117 LG:345527.1:2000FEB18 24%, Respiratory System - 18% 

1 1 8 LG:1 089383.1 :2000FEB1 8 Connective Tissue - 73%, Female Genitalia - 27% 

Female Genitalia - 38%, Exocrine Glands - 31%, 
Male Genitalia - 15%, Hemic and Immune System - 

119 LG:1092522.1:2000FEB18 15% 

1 20 LG:1 09321 6.1 :2000FEB1 8 Urinary Tract - 1 00% 

Embryonic Structures - 86%, Hemic and Immune 
System -14% 

Unclassified/Mixed - 34%, Hemic and Immune 
System - 20%, Urinary Tract - 1 7% 
Nervous System - 62%, Urinary Tract - 38% 
Female Genitalia - 100% 
Endocrine System - 100% 
Digestive System - 75% 
Embryonic Structures - 28%, Skin - 20%, 
Unclassified/Mixed -16% 
Cardiovascular System - 33%, Endocrine System - 

1 28 LG:247388.1 :2000MAY1 9 21%, Male Genitalia - 21% 

129 LG:255789.10:2000MAY19 Endocrine System - 56%, Urinary Tract -44% 

Endocrine System - 22%, Digestive System - 13%, 

1 30 Ll:78761 8.1 :2000MAY01 Endocrine System - 1 2% 

Sense Organs - 18%. Nervous System -11%: 



1 21 Ll:27031 8.3:2000FEB01 

1 22 Ll:335671 .2:2000FEB01 

1 23 Ll:793758.1 :2000FEB01 

1 24 Ll:80371 8.1 :2000FEB01 

1 25 Ll:41 21 79.1 :2000FEB01 

1 26 Ll:81 5679.1 :2000FEB01 

1 27 Ll:481 361 .3:2000FEB01 
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Digestive System - 34%, Liver - 1 7%, Female 

139 LG:337818.2:2000FEB18 Genitalia -11% 

Digestive System - 27%, Liver - 19%, Female 

140 LI:337818.1:2000FEB01 Genitalia - 15% 

Pancreas - 48%, Endocrine System - 24%, 

141 LG:241577.4:2000MAY19 Respiratory System- 14% 

Respiratory System - 67%, Digestive System - 22%, 

142 LG:344786.4:2000MAY19 Nervous System - 1 1% 

Endocrine System - 44%, Unclasslfled/MIxed - 1 7%, 

143 LI:414307.1:2000FEB01 Nervous System - 1 1% 

144 LI:202943.2:2000FEB01 Embryonic Structures - 100% 

145 LI:246194.2:2000FEB01 Germ Cells - 75%, Pancreas - 13% 

146 LI:815961.1:2000FEB01 Digestive System - 99% 

Skin - 33%, Embryonic Structures - 21%, Digestive 

147 LG:120744.1:2000MAY19 System -21% 

Musculoskeletal System - 45%, Cardiovascular 

148 U:757520.1 :2000MAY01 System - 26%, Skin - 24% 

149 LG:160570.1:2000FEB18 Skin - 84%, Female Genitalia - 16% 

Male Genitalia - 50%, Hemic and Immune Svstem - 

150 LI:350398.3:2000FEB01 50% 

151 LI:221285.1:2000FEB01 Endocrine System - 42%, Nervous System - 21% 

153 LI:329017.1:2000FEB01 Endocrine System - 62%, Unclassified/Mixed - 24% 

154 Ll:401322.1 :2000FEB01 Sense Organs - 44%, Uver - 22%, Skin - 14% 

Respiratory System - 18%, Female Genitalia - 16%, 

155 LG:403409.1:2000MAY19 Cardiovascular System - 13% 

1 56 LG:233933.5:2000MAY1 9 Digestive System - 1 00% 

Connective Tissue - 40%, Nervous System - 1 9%, 

157 LI:290344.1:2000MAY01 Embryonic Structures - 12% 

158 Ll:41 0742.1 :2000MAY01 Respiratory System - 47%, Skln-42% 

Stomatognathlc System - 57%, Musculoskeletal 

159 LG:406568.1:2000MAY19 System-21%, Cardiovascular System- 16% 

1 60 Ll:283762.1 20Q0MAY01 Sense Organs - 25% 

1 61 Ll:347687.1 1 3:2000MAY01 Nervous System - 45%, Nervous System - 38% 

162 LI:1146510.1:2000MAY01 Skin -94% 

163 LG:451710.1:2000FEB18 Connective Tissue - 89%, Nervous System - 1 1% 

164 LG:455771 .1 :2000FEB1 8 Nervous System - 100% 

165 LG:452089.1:2000FEB18 Nervous System - 100% 

166 LG:246415.1:2000FEB18 Pancreas - 83%, Nervous System - 17% 

Cardiovascular System - 17%, Connective Tissue - 

167 LG:414144.10:2000FEB18 12% 

168 LG:1101445.1:2000FEB18 Uver-91% 

Hemic and Immune System - 64%, Male Genitalia - 

169 LG:452134.1:2000FEB16 36% 

1 70 Ll:903021 .1 :2000FEB01 Male Genitalia - 1 00% 

171 Ll:246422.1 :2000FEB01 Hemic and Immune System - 100% 

1 72 LG:449404. 1 :2000MAY1 9 Nervous System - 1 00% 

173 LG:449413.1:2000MAY19 Nervous System- 100% 

174 LG:450105.1:2000MAY19 Nervous System- 100% 

1 75 LG:460809.1 :2000M AY1 9 Exocrine Glands - 1 00% 

176 LG:481781.1:2000MAY19 Nervous System- 100% 

177 LG:1101153.1:2000MAY19 Nervous System- 100% 

Exocrine Glands - 28%, Endocrine System - 1 9%, 

1 78 LI:257695.20:2000MAY01 Nervous System - 1 6%, Diaestive System - 1 6% 

179 LI:455771.1:2000MAY01 Nervous System- 100% 

Nervous System - 60%, Hemic and Immune System - 

180 LI:274551.1:2000MAY01 40% 
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Embryonic Structures - 58%, Digestive System - 26%, 

1 81 Ll:035973.1 :2000MAY01 Nervous System - 1 6% 

182 LG:978427.5:2000FEB18 Nervous System- 100% 

183 LG:247781.2:2000FEB18 Nervous System- 11% 

1 84 Ll:034583.1 :2000FEB01 Nervous System - 35%, Endocrine System - 35% 

Cardiovascular System - 28%, Urinary Tract - 27%, 

185 LI:333307.2:2000FEB01 Musculoskeletal System - 17% 

186 LI:814710.2:2000FEB01 Respiratory System - 100% 

187 LG:414732.1:2000MAY19 Endocrine System - 82%, Nervous System - 18% 

Connective Tissue - 55%, Nervous System - 1 5%, 

188 LG:413910.6:2000MAY19 Embryonic Structures - 13% 

189 LI:414732.2:2000MAY01 Endocrine System - 80%, Nervous System - 20% 

190 LI:900264.2:2000MAY01 Urinary Tract - 15%, Male Genitalia- 12% 

Urinary Tract - 46%, Endocrine System - 1 7%, Germ 

1 91 Ll:335593.1 :2000MAY01 Cells - 14% 

Stomatognathic System - 35%, Digestive System - 

192 Ll:11 89543.1 :2000MAY01 14% 

1 93 LG:455450.1 :2000FEB1 8 Nervous System - 1 00% 

194 . LG:1040978.1:2000FEB18 Nervous System - 1 00% 

1 95 LG:446649.1 :2000FEB1 8 Uver - 80%, Hemic and Immune System - 1 3% 

Unclasslfied/Mbced - 17%, Sense Organs - 16%, 

196 LG:132147.3:2000FEB18 Embryonic Structures - 1 0% 

1 97 LL036034.1 :2000FEB01 Nervous System - 80% 

Unclassified/Mixed - 53%, Cardiovascular System - 

198 LG:162161.1:2000MAY19 21%, Nervous System - 16% 

Unclassified/Mixed - 40%, Respiratory System - 24%, 

199 LG:407214.10:2000MAY19 Cardiovascular System - 16% 

Digestive System - 41%, Exocrine Glands - 24%, 

200 LG:204626.1 :2000MAY1 9 Female Genitalia - 1 8% 

Unclassified/Mixed - 31 %, Nervous System - 25%, 

201 L!:007401 .1 :2000MAY01 Urinary Tract - 1 1 % 

202 LU476342.1 :2000MAY01 Connective Tissue - 77%, Nervous System - 23% 

Hemic and Immune System - 27%, Musculoskeletal 

203 Ll:1 072759.1 :2000MAY01 System - 1 9%, Endocrine System - 1 1 % 

204 LG:998857.1 :2000FEB1 8 Digestive System - 58%, Pancreas - 12% 

205 LG:482261.1:2000FEB18 Male Genitalia - 85%, Respiratory System - 15% 

Skin - 20%, Germ Cells - 18%, Female Genitalia - 

206 LG:480328.1:2000FEB18 10% 

Germ Celis - 44%, Digestive System - 15%, Male 

207 LG:311197.1:2000MAY19 Genitalia - 1 1% 

208 LG:1 054883.1 :2000MAY1 9 Endocrine System - 1 00% 

209 LG:399395.1:2000MAY19 Hemic and Immune System - 100% 

Germ Cells - 23%, Exocrine Glands - 14%, 

21 0 LG:380497.2:2000MAY1 9 Connective Tissue - 1 3% 

21 1 Ll:27291 3.22:2000MAY01 Female Genitalia - 100% 
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CLAIMS 

What Is claimed is: 

1 . An isolated polynucleotide comprising a polynucleotide sequence selected from the group 
5 consisting of: 

a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:l-211, 

b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:l-211, 

c) a polynucleotide sequence complementary to a), 

10 d) a polynucleotide sequence complementary to b), and 

e) an RNA equivalent of a) through d). 

2. An isolated polynucleotide of claim 1 , comprising a polynucleotide sequence selected from 
the group consisting of SEQ ID NO: 1-21 1 . 

15 

3. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a polynucleotide 
of claim 1. 

4. A composition for the detection of expression of diagnostic and therapeutic polynucleotides 

2 o comprising at least one of the polynucleotides of claim 1 and a detectable label. 

5. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 , the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
25 amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof , and, optionally, if present, the amount thereof. 

6. A method for detecting a target polynucleotide in a sample, said target polynucleotide 

3 o comprising a sequence of a polynucleotide of claim 1 , the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization complex 
is formed between said probe and said target polynucleotide or fragments thereof, and 
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b) detecting the presence or absence of said hybridization complex, and, optionally, if present, 
the amount thereof. 

7. A method of claim 5, wherein the probe comprises at least 30 contiguous nucleotides. 

8. A method of claim 5, wherein the probe comprises at least 60 contiguous nucleotides. 

9. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 1 . 

10. A cell transformed with a recombinant polynucleotide of claim 9. 

11. A transgenic organism comprising a recombinant polynucleotide of claim 9. 

12. A method for producing a diagnostic and therapeutic polypeptide, the method comprising: 

a) culturing a cell under conditions suitable for expression of the diagnostic and therapeutic 
polypeptide, wherein said cell is transformed with a recombinant polynucleotide of claim 9, and 

b) recovering the diagnostic and therapeutic polypeptide so expressed 

1 3. A purified diagnostic and therapeutic polypeptide (DITHP) encoded by at least one of the 
polynucleotides of claim 2. 

14. An isolated antibody which specifically binds to a diagnostic and therapeutic polypeptide 
of claim 13. 

15. A method of identifying a test compound which specifically binds to the diagnostic and 
therapeutic polypeptide of claim 13, the method comprising the steps of : 

a) providing a test compound; 

b) combining the diagnostic and therapeutic polypeptide with the test compound for a 
sufficient time and under suitable conditions for binding; and 

c) detecting binding of the diagnostic and therapeutic polypeptide to the test compound, 
thereby identifying the test compound which specifically binds the diagnostic and therapeutic 
polypeptide. 
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1 6. A microarray wherein at least one element of the microairay is a polynucleotide of claim 3. 



17. A method for generating a transcript image of a sarqple which contains polynucleotides, 
5 the method comprising the steps of: 

a) labding the polynucleotides of the sample, 

b) contacting the dements of the microarray of claim 16 with the labeled polynucleotides of the 
sample under conditions suitable for the formation of a hybridization complex, and 

c) quantifying the expression of the polynucleotides in the sample, 

10 

18, A method for screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a polynudeotide sequence of claim 1 , the 
method comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, under conditions 
1 5 suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide* and 

. c) comparing the expression of the target polynucleotide in the presence of varying amounts of 
the compound and in the absence of the compound. 



20 19. A method for assessing toxidty of a test compound, said method comprising: 

a) treating a biological sample containing nucleic acids with the test compound; 

b) hybridizing the nucleic adds of the treated biological sample with a probe comprising at 
least 20 contiguous nucleotides of a polynucleotide of daim 1 unto conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 

2 5 sample, said target polynucleotide comprising a polynucleotide sequence of a polynucleotide of claim 1 
or fragment thereof; 

c) quantifying the amount of hybridization complex; and 

d) comparing the amount of hybridization complex in the treated biological sample with the 
amount of hybridization complex in an untreated biological sample, wherein a difference in the amount 

30 of hybridization complex in the treated biological sample is indicative of toxidty of the test compound. 



20. An array comprising different nucleotide molecules affixed in distinct physical locations on 
a solid substrate, wherein at least one of said nudeotide molecules comprises a first oligonudeotide or 
polynucleotide sequence specifically hybridizable with at least 30 contiguous nudeotides of a target 
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polynucleotide, said target polynucleotide having a sequence of claim 1 . 



21 . An array of claim 20, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to at least 30 contiguous nucleotides of said target polynucleotide. 

5 

22. An array of claim 20, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to at least 60 contiguous nucleotides of said target polynucleotide 

23. Anarray of claim20, which is a microarray. 

10 

24. An array of claim 20, further comprising said target polynucleotide hybridized to said first 
oligonucleotide or polynucleotide. 

25. An array of claim 20, wherein a linker joins at least one of said nucleotide molecules to 
15 said solid substrate. * 

26. An array of claim 20, wherein each distinct physical location on the substrate contains 
multiple nucleotide molecules having the same sequence, and each distinct physical location on the 
substrate contains nucleotide molecules having a sequence which differs from the sequence of 

2 o nucleotide molecules at another physical location on the substrate. 

27. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

a) an amino acid sequence selected from the group consisting of SEQ ID NO:21 2-422, 
25 b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 

amino acid sequence selected from the group consisting of SEQ ID NO:21 2-422, 

c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO:212-422, and 

d) an immunogenic fragment of an amino acid sequence selected from the group consisting 
30 ofSEQIDNO:212-422. 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:1040582 . 1 :2000FEB18 
<400> 1 

cggccgtcca gacgtcgtgg tggagtgaga ggttcccagg gaggaatgtg caaagtcctg 60 
ttggttctac ccctcttcta cttcgaggaa ggggtccgcg tcgcgcgttt ctggaacggg 120 
ctctctccct ggggtggctg ctgttctccg ccagattaag ccgggcctgc tggctcagta 180 
ctggagtgca gagctgaatt ctggccactt tgtcttctcc acagcctgtg ctcattgcca 240 
aggggacaat gacggcctcc agtgtcctcc tgcacactgg acagaagatg cctctgattg 300 
gtctggggac atggaagagt gagcctggtc aggtgaaagc agctattaaa tatgccctta 360 
gcgtaggcta ccgccacatt gactgtgctt ctgtatatgg caatgaaact gagattgggg 420 
aggccctgaa ggagagtgtg gggtcaggca aggcagtccc tcgagaggag ctgtttgtga 480 
catccaagct gtggaatact aagcaccacc ctgaggatgt agaacctgcc ctccggaaga 540 
cac 543 

<210> 2 

<211> 633 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 453570 . 1 :2000FEB18 
<400> 2 

cctgcacgct cctccgccfcg attcccagcg cgtcgcactt caagagattc gatcgggttc 60 
gccggttcgc gcccgccgcc atggcgactt cctcaggacc aaaggaggcg ccagcgaata 120 
accctgggct ccagaccgag gtcgaccccg ccaccaaagg ctacttcttg caacaaacga 180 
tgttacgtgt gaaggaccca aaagtgagcc ttgacttcta ctcacgtgtg atgggcatgt 240 
cgttgctgaa aaggttggat tttgaagaga tgaaattcag tttgtatttt ctaggttacg 300 
aggatgtgac cttagcccct gatgatcata tcaagcggac tgaatggact tttaggcaaa 360 
aagctactct tgagctcacc cacaactggg gcacggaaaa tgaccctgaa ttcaaaggtt 420 
accataatgg gaactcagac cctcgtggtt ttggtcatat aggagtaact gtggatgatg 480 
tccactaggc atgtgaacgc tttgaacgtc ttggggttga gtttgtgaag aaacctgatg 540 
atggaaaaat caatggcatc gctttcatca aagatcctga tggttatgga tcgaaatatt 600 
tgaccagacg attgggacgg ttacttcttc age 633 

<210> 3 

<211> 1674 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 408751 . 3 :2000FEB18 
<400> 3 

tagggaccca ggatggcaga tccgggaccg ggctgggctg gcttggaaca tgcttgccaa 60 
ctcagccagc gtgaggatcc tcatcaaggg aggcaaggtg gtgaacgatg actgcaccca 120 
cgaggctgac gtctacatcg agaatggcat catccagcag gtgggccgcg agctcatgat 180 
ccctggcggg gccaaggtga ttgatgecac aggaaaactg gtgatccctg gtggcatcga' 240 
caccagcacc cacttccacc agaccttcat gaatgccacg tgcgtggacg acttctacca 300 
tgggaccaag gcagcactcg teggaggcac caccatgatc atcggccacg tcctgcccga 360 
caaggagacc tcccttgtgg aegcttatga gaagtgccga ggtctggccg accccaaggt 420 
ctgctgtgat tacgccctcc acgtggggat cacctggtgg gcacccaagg tgaaagcaga 480 
aatggagaca ctggtgaggg agaagggtgt caactcgttc cagatgttca tgacctacaa 540 
ggacctgtac atgettcgag acagtgagct gtaccaagtg ttgeacgett gcaaggacat 600 
tggggcaatc gcccgcgtcc atgctgaaaa tggggagctt gtggccgagg gtgctaagga 660 
ggcactggat ttggggatca caggcccaga aggaatcgag atcagccgtc cagaggagct 720 
ggaagctgaa gccactcatc gtgttatcac cattgeaaac aggactcact gtccaatcta 780 
cctggtcaac gtgtccagta teteggctgg tgacgttatc geagctgeta agatgeaagg 840 
gaaggttgtg ctggcggaga ccaccactgc acatgccacg ctgacaggct tacactacta 900 
ccaccaggac tggtcccacg cggctgccta tgtcacggtg cctcccctga gactggacac 960 
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caacacctca acctacctca tgagcctgct ggccaagtaa ggcgtttcag cagcacattg 
cagggatgtg tacatcttta ggaagacgtc atagagggcc caggaaacaa atctgagcta 
ggtttgattt cattgtgccc ataggatcat tgtgcttttc ttaccttctc tgagcccatc 
agataattgc catcctattg ggattttatg accgcttgat atggagtgag atcttctgaa 
gtataagagt gaagagcaca tatagaactg tgaaccgtgg ttattcggaa tttcaaagga 
cattgatctc ttactgctaa agcccagtgg ggagctgcag aacgtggcct gtgacttatc 
ctagaatgta ggggtccttt ctgtttcttt tgttcacctc tccctcttct tgttggctcc 
cgtgtgcccc ttgtctgtac atccctcctt ctgtgggttt ctcctccttt ccttcccctt 
tccctttttc ttctgcactt tctccccctt tccttcccac acccttcctc cagatctcct 
ttctccagct cttctatttc tgacactgct ttttccccat gcaatcccaa tctaaaggaa 
atggtttgat gaacggtctg aattcagtgg gacccaatta tggaagggtt tctgttttat 
cccaataact gcttgcagct tctttctctc tcctattaac atttttgggg ccag 

<210> 4 
<211> 1248 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI: 09 0574. 1 :2000FEB01 
<220> 

<221> unsure 
<222> 1148, 1234 
<223> a, t, c, g, or other 

<400> 4 

cgcgtacgta agctcggaat 
gctggggata ccgcgagcac 
atggtgatca gcaatctcca 
tccgaccact tagtatcaag 
attccttcaa tgttgacttt 
tcactggaag ctacaggtta 
gctccgagca catagtagat 
attcagacaa ataccccagc 
tgggagtgtt tttacagatt 
tggattccat taaagaaaag 
tgcttccacc atcctgggac 
ttgagagtgt cacatggatt 
ccaaatttcg cagtctcctg 
acttggccac cattccctgt 
ctgggtccca aactgaatcc 
gagaagtatt cattaacagt 
taggcataaa ataaggagaa 
atgccaaggc tgcttgaacg 
ttgttataag ttttccacaa 
aggtgggntg gacta^gatca 
acccattaat taaaagcttt 

<210> 5 
<211> 1451 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: LI : 229932 .2 :2000FEB01 

<400> 5 

taatgattca aagataattt tataaaagcc aaagtaactt gtagctaata tataaaatac 60 
ttttgaccag gaacttttaa agtggatgct atgaagatag agagtgtttt tacacattgt 120 
ttattgattt taatttcact ttttctctcc ttagggtgtg attaggccca ctgggggata 180 
cgctgtcatg ctacactggt caatgtcttc catatacgga cttcaacccg gagaggtgtg 240 
gtgggcagct tctgacttag gctgggttgt tggacattcc tatatctgct atggacctct 3 00 
tcttcatggg aacacaacag ttttatatga ggggaagcct gtgggaacac cagatgctgg 3 60 
cgcttatttc cgtgtgcttg cagagcatgg agtagctgcc ttgtttacag caccaactgc 420 



3/228 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1674 



tcggctcgag ttccaccccg 
aacggtccta ttcactggaa 
attgagatta aaaccaaaga 
tatgacccaa gctcagctaa 
gatgacacag agaacaaatc 
cggcaggttc accttcactg 
ggagtgagct atgctgcaga 
tttgttgagg cagctcatga 
ggtgaaccta attcccaact 
ggtaaacaaa ctcgattcac 
tactggacat atcctggttc 
gttttaaagc aacctataaa 
tgcacagcgg agggtgaagc 
gggagcagct ctgccacctg 
actgggcctt gcttttgctg 
aaacattagt tgtgctcagc 
gtggcctcgg gtgaaagcag 
ttggattact cttcacctat 
gtggcataga gtctcttctg 
agggtcttca gcagttttga 
ttaatcaaaa gctntcttac 



agggaccatg tcgaggctca 60 
ggaatttttc cctattgctg 120 
agtgaaatat gactcttccc 180 
aa teat cage aacagcggcc 240 
agttctgcgt ggtggtcctc 300 
ggggtccget gatgaccacg 3 60 
gctccatgtt gttcactgga 420 
accagatgga ctggctgtct 480 
gcaaaagatt actgacactt 540 
aaattttgac ctattgtctc 600 
tcttacagtt ccacctcttc 660 
catcagctct caacagctgg 720 
agcagctttt ctggtgcatc 780 
ctctgggcat tgttaatcct 840 
ctgctaggga agagaaggca 900 
cctctgcagt ctacctctgc 960 
gtttggatta aaatttccta 1020 
acttctataa atgtgtaatc 1080 
tgaacactgt tcttctgtgc 1140 
teagecagtg gatttcaaag 1200 
aaagggcg 1248 
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aattagagca atccgtcaac aggaccctgg ggcagctttg gggaagcagt actctctgac 480 
aaggttcaaa acattatttg tggctggaga acgatgtgat gtagagaccc tggaatggtc 540 
caaaaatgtc ttcagagtac ctgtcttaga ccattggtgg caaactgaga ctggatctcc 600 
aattactgcg tcatgtgttg gattaggcaa ttctaaaaca cctccaccag ggcaagcagg 660 
aaaaagcgcc caggatacaa tgttatgatt ttggatgaca acatgcaaaa ctgaaggctc 720 
ggtgtttagg aaatattgtg gtaaaagtta ccattgccac ctggggcttt ttcaggactc 780 
tggaagaatc aggaagcatt caagcattta tactttgaaa aatttcctgg atattatgat 840 
accatggatg ctggttacat ggatgaagaa ggctatttgt atgttatgtc tcgagtggat 900 
gatgtaataa atgttgcagg tcacagaatt tctgcaggcg ccattgaaga gtcaatcctt 960 
tcccatggta ccgtggcaga ctgtgctgtt gttggcaagg aagatccctt aaaaggtcat 1020 
gtccccttag cactctgtgt attgagaaaa gatataaatg caacagagga gcaagttttg 1080 
gaagaaattg tgaaacacgt tagacagaac attggccctg tggctgcttt tcgaaatgca 1140 
gtgtttgtca aacagctacc caaaaccaga tctggcaaga tcccccgatc agctttatct 1200 
gccattgtca atggcaagcc atacaagata acttctacaa ttgaagaccc cagcattttt 1260 
ggccacgtag aagaaatgct gaagcaagca taatgagttt gtcttattcc tattttgagt 1320 
tgatttaatt tcttaattga aattaaatta tttgagfctgt ttctcagaaa taaatatttc 1380 
agtagaaatt acttgcaaaa tgaaatgtga attgtaaaac ttggcctaaa caaatctaat 1440 
gaaaacatgt g 1451 

<210> 6 

<211> 2615 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223>Incyte ID No: LI : 332176 . 1 :2000FEB01 
<400> 6 

actgataaaa cgatctaaag ggaaggagag accctttgtt cttacacgtt ctttctttgc 60 
tggatcacaa aagtatggtg ccgtgtggac aggcgacaac acagcagaat ggagcaactt 120 
gaaaatttct atcccaatgt tactcactct cagcattact gggatctctt tttgcggagc 180 
tgacataggc gggttcattg ggaatccaga gacagagctg ctagtgcgtt ggtaccaggc 240 
tggagcctac cagcccttct tccgtggcca tgccaccatg aacaccaagc gacgagagcc 300 
ctggctcttt ggggaggaac acacccgact catccgagaa gccatcagag agcgctatgg 360 
cctcctgcca tattggtatt ctctgttcta ccatgcacac gtggcttccc aacctgtcat 420 
gaggcctctg tgggtagagt tccctgatga actaaagact tttgatatgg aagatgaata 480 
catgctgggg agtgcattat gggttcatcc agtcacagaa ccaaaagcca ccacagttga 540 
tgtgtttctt ccaggatcaa atgaggtctg gtatgactat aagacatttg ctcatgggga 600 
aggagggtgt actgtaaaga tcccagtagc cttggacact attccagtgt ttcagcgagg 660 
tggaagtgtg ataccaataa agacaactgt aggaaaatcc acaggctgga tgactgaatc 720 
ctcctatgga ctccgggttg ctctaagcac taagggttct tcagtgggtg agttatatct 780 
tgatgatggc cattcattcc aatacctcca ccagaagcaa tttttgcaca ggaagttttc 840 
attctgttcc agtgttctga tcaatagttt tgctgaccag aggggtcatt atcccagcaa 900 
gtgtgtggtg gagaagatct tggtcttagg cttcaggaag gagccatctt ctgtgactac 960 
ccactcatct gatggtaaag atcagcctgt ggcttttacg tattgtgcca aaacatccat 1020 
cctgagcctg gagaagctct cactcaacat tgccactgac tgggaggtcc gcatcatatg 1080 
acaaagaact gcccctggtg atgtgagcag ggacctgcct gcccctttca acctttcccc 1140 
tcaccttttt tgagattttt gctgcaatct gtttgccttc cctgaatcaa aataatcttt 1200 
cattcgtcac cattatacta atgaacaata gatttcatgt ttcaaaattt cagattttac 1260 
atgttataga tgtactaaca atattccttg tatcaaacat ctccttttct ccctgataca 1320 
tagccctgag acatttatag cgttcaggag tcttctattg cttccattcc ttcagcaggg 1380 
ctgcgtgggt ctgttttaac gtgggccaag cctacctggg cagcccattt gccagggctt 1440 
gcctcaggcc atgcagcatt ggcgctctgg ctgcagcagc tgagttgctc aaggccagtg 1500 
tccaagtgga cagcagcctc tggtactccc cccatttatc ttccacccac atggagctgg 1560 
gcagagcagc cctcttctgt gtgcactgca tacgctgcag ccgtgggagt tattctcccc 1620 
tagagatcga cttggcagca cgaaggattc ttttctcttt catgcttctc aggctcaata 1680 
gtttctaatt aatcttaaaa tccatgtctt ttacattgtt tttttaatta agtgctgttt 1740 
actaaccaaa taatatttat aacatgagta agctataatt aataacaatg aaataaatac 1800 
ccatgtaccc accactggac ttcagaagta gaactcatga ctgggactag gatgaggcaa 1860 
gggagaccct ggccttgggc acaaaatgta agggatgcca aaaaaataca gtaatcaaag 1920 
taagtaatat ttcaatccaa tatttttaaa aatcagaatt aatgcaaaaa aaaccatgat 1980 
gaacaaaata ttaaaattta aaataaagac aggattagta ttactgagtt ttccttttgt 2040 
cccaggctct aatatggctt ggcatggggc agaacattac aacataccag tcgtgtcatg 2100 
gtgcccaagg ctccacagac ctcagtggct ccctgctgcc tgccacagca tctgttttag 2160 
cagcctcgac tcctcagcac tcctoagcac acacctcttc ttatcaggct tcctccactt 2220 
agcaacttgc taacggccac ctctgtgcct tctgatccct gggcgccaat atcctcctgc 2280 
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ccttaccatc cttccaggcc caacttaaat cccactttcc catgaagcct aactgcgtga 2340 
acacccctac ccccataccc attagcagtg attttgccct tccccgtaat gctgtcccac 2400 
ttataactgt gctctactta gcattctcag ggatcatacc ttaatgtttt cagtatgtct 2460 
gcgttctcct actagattgt atgtccctca agagcatgtt ctgtttctct tctgtctgac 2520 
agagcactat tatacctgac tttcagtaac tgttagctgt gattagttag ctggtggatt 2580 
taattgatta aaaaattacg attgaatgta aaaaa 2615 

<210> 7 

<211> 491 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 403248 . 2 :2000FEB01 
<400> 7 " 

gcctggcagg aggaggaaaa tggttccacc tccaagacca tcgtgtacca ccttgatgag 60 
ggcgtgtatg ggatcttcaa ctcagtcctg tttgacaaca tctgccctac ccccatcctg 120 
cagaagtcta agaaccactc accctgctac atgtctctag agtccattca tttcattgcc 180 
gtgtagcgct cttttgaaac catccacgga gcagcccctg tacagcagca gcctgtgggg 240 
cccggcggtt gatggctgtg attgcgtggc tgagggcctg tggctgccgc aactacacgt 300 
aggggactgg ctggtctttg acaacatggg cgcctacact gtgggcatgg gttccccctt 360 
ttgggggacc caggcctgcc acatcaccta tgccatgtcc cgggtggcct gggaagcgct 420 
gcgaaggcag ctgatggctg cagaacagga ggatgacgtg gagggtgtgt geaagcctct 480 
gtcctgcggc t 491 

<210> 8 

<211> 2309 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID ; No: LG: 220992 . 1:2000MAY19 
<220> 

<221> unsure 
<222> 1738 

<223> a, t, c, g, or other 
<400> 8 

aattaaagat ggctgcgccc atgtaacatc actagcgacc ggtgacctct ttttccccct 60 
tgcctggctc ctgtggtggc aggctgggca cgaggaccat gctgggccgg agcctccgag 120 
aagtttctgc ggcactgaaa caaggccaaa ttacaccaac agagctctgt caaaaatgtc 180 
tctctcttat caagaagacc aagtttctaa atgcctacat tactgtgtca gaagaggtgg 240 
ccttaaaaca agctgaagaa tcagaaaaga gatataagaa tggacagtca cttggggatt 300 
tagatggaat tcctattgca gtaaaagaca atttcagcac ttctggcatt gagacaacat 360 
gtgcatcaaa tatgctgaaa ggttatatac caccttataa tgctacagta gttcagaagt 420 
tgttggatca gggagctcta ctaatgggaa aaacaaattt agatgagttt gctatgggat 480 
ctgggagcac agatggtgta tttggaccag ttaaaaaccc ctggagttat tcaaaacaat 540 
atagagaaaa gaggaagcag aatccccaca gcgagaatga agattcagac tggctgataa 600 
ctggaggaag ctcaggtggg agtgcagctg ctgtatcggc gttcacatgc tacgcggctt 660 
taggatcaga tacaggagga tcgaccagaa atcctgctgc ccactgtggg cttgttggtt 720 
tcaaaccaag ctatggctta gtttcccgtc atggtctcat tcccctggtg aattcgatgg 780 
atgtgccagg aatcttaacc agatgtgtgg atgatgcagc aattgtgttg ggtgcactgg 840 
ccggacctga ccccagggac tctaccacag tacatgaacc tattaataaa ccattcatgc 900 
ttcccagttt ggcagatgtg agcaaactat gtataggaat tccaaaggaa tatcttgtac 960 
cggaattatc aagtgaagta cagtctcttt ggtccaaagc tgctgacctc tttgagtctg 1020 
agggggccaa agtaattgaa gtatcccttc ctcacaccag ttattcaatt gtctgctacc 1080 
atgtattgtg cacatcagaa gtggcatcga atatggcaag atttgatggg ctacaatatg 1140 
gtcacagatg tgacattgat gtgtccactg aagccatgta tgctgcaacc agacgagaag 1200 
ggtttaatga tgtggtgaga ggaagaattc tctcaggaaa ctttttctta ttaaaagaaa 1260 
actatgaaaa ttattttgtc aaagcacaga aagtgagacg cctcattgct aatgactttg 1320 
taaatgcttt taactctgga gtagatgtct tgctaactcc caccaccttg agtgaggcag 1380 
taccatactt ggagttcatc aaagaggaca acagaacccg aagtgcccag gatgatattt 1440 
ttacacaagc tgtaaatatg gcaggattgc cagcagtgag tatccctgtt gcactctcaa 1500 
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accaagggtt gccaatagga ctgcagttta ttggacgtgc gttttgtgac cagcagcttc 1560 

ttacagtagc caaatggttt gaaaaacaag tacagtttcc tgttattcaa cttcaagaac 1620 

tcatggatga ttgttcagca gtccttgaaa atgaaaagtt agcctctgtc tctctaaaac 1680 

agtaaacata tcttacaaat taaaatgact tttaggctgg gattacaggc gttagcancc 1740 

atgcccggct taaaactggc tttttaaatg gaacttttgt ttattttatt ttatttcttc 1800 

cttccctttt tttacaattt gaatggaact tttagctttg ccgtgctgcg ttaggaaatg 1860 

aagacacatc ctgaactgga acaagacttg ccaggggtga actggcaaac tttcctgtac 1920 

taacttcaca atccaaacca cacgatgtta tttcttatag tggctgtaaa ggaagaagct 1980 

gcagggctta aatgtactct tttccagttt ctggatttat ctgtagttta cttttatttt 2040 

gtcgctttgg tcgtgcattg aatgttctct tgactttcaa agatttgaaa atatatttaa 2100 

tatagtaatt cagagggggt gaactcttgc atggtttgat taaaattata gccttaaaaa 2160 

gcattgttag ttttagggca gatagccttt ggattcaagt atatcttgtc tttcatgtat 2220 

caattcaact gaccatttga ccttaaaagt tggtattttt aattccgaat tatttgtgtg 2280 

ctaccgtaaa tcaggcgtat tgcttgtca 2309 

<210> 9 

<211> 930 

<212> DNA 

<213> Homo sapiens 

<220> * 
<221> misc_feature 

<223> incyte ID No: LG : 1094571 .1:2000MAY19 
<400> 9' 

cgcacgccgg cggcgcggag gccggctctg cgcttcgggc cgccccctcc ccccaccccg 60 
ctcacactcg gcacttactt cggctgtctc cgctgccctc cagcggagac gcagctcctc 120 
aggcgcccgg cggtatttgt tgggtcggcg gcgtcaggga ttcgcagtgg cctgtggtcg 180 
gcgtcgtccg gccactggtg cgcccccgcg gcaggcagag ctcacgctcc tgtcccccgg 240 
ctggtccggg gtctgggcgc cgcgtcgacg gcggctccgc aggacgcgca gaccgggccg 3 00 
cagcccatgc cccgagcgga ctgcattatg aggcacctgc cttacttctg ccggggtcaa 360 
gtggtgcggg gcttcggccg cggctccaag cagctgggca tccccacagc taattttcct 420 
gagcaagtgg tagataatct tccagctgat atatccactg gtatttacta tggttgggcc 480 
agtgttggaa gtggagatgt ccataagatg gtggtgagca taggatggaa cccatattac* 540 
aagaatacga agaagtctat ggaaacacat atcatgcata ccttcaaaga ggacttctat* 600 
ggggaaatcc tcaatgtggc cattgttggc tacctgagac cagaaaagaa ctttgattct' 660 
ttagagtcac ttatttcagc aattcaaggt gatattgaag aagctaagaa acgactagag 720 
ttaccagaac atttgaaaat caaagaagac aatttcttcc aggtttctaa aagcaaaata 780 
atgaatggcc actgatgaaa aattgtatta tttattcatt cactgttttc tagtgtttct 840 
gtgtttatta ctgttcagca tcatcttggt tatatactga aaatcaaaca ctttactaca 900 
gttgtgaatt agtttaaacc gtacaatgtt 930 

<210> 10 
<211> 2700 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No : LI : 350754 .4 :2000MAY01 
<400> 10 

gcactgtccg tgccattccc agaggagcct gagaagaggc agaggaaggg cgaaacatgg 60 
ctgctctagg agtccagagt atcaactggc agaaggcctt caaccgacaa gcgcatcaca 120 
cagacaagtt ctccagccag gagctcatct tgcggagagg ccaaaacttc caggtcttaa 180 
tgatcatgaa caaaggcctt ggctctaacg aaagactgga gttcattgac accacagggc 240 
cttacccctc agagtcggcc atgacgaagg ctgtgtttcc actctccaat ggcagtagtg 300 
gtggctggag tgcggtgctt caggccagca atggcaatac tctgactatc agcatctcca 3 60 
gtcctgccag cgcacccata ggacggtaca caatggccct ccagatcttc tcccagggcg 420 
gcatctcctc tgtgaaactt gggacgttca tactgctttt taacccctgg ctgaatgtgg 480 
atagcgtctt tatgggtaac catgctgaga gagaagagta tgttcaggaa gatgccggca 540 
tcatctttgt gggaagcaca aaccgaattg gcatgattgg ctggaacttt ggacagtttg 600 
aagaagacat tctcagcatc tgcctctcaa tcttggatag gagtctgaat ttccgccgtg 660 
acgctgctac tgatgtggcc agcagaaatg accccaaata cgttggccgg gtgctgagtg 720 
ccatgatcaa tagcaatgat gacaatggtg tgcttgctgg gaattggagc ggcacttaca 780 
ccggtggccg ggacccaagg agctgggacg gcagcgtgga gatcctcaaa aattggaaaa 840 
aatctggctt cagcccagtc cgatatggcc agtgctgggt ctttgctggg accctcaaca 900 
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cagcgctgcg gtctttgggg attccttccc gggtgatcac caacttcaac tcagctcatg 960 
acacagaccg aaatctcagt gtggatgtgt actacgaccc catgggaaac cccctggaca 1020 
agggtagtga tagcgtatgg aatttccatg tctggaatga aggctggttt gtgaggtctg 1080 
acctgggccc cccgtacggt ggatggcagg tgttggatgc taccccgcag gaaagaagcc 1140 
aaggggtgtt ccagtgcggc cccgcttcgg tcattggtgt tcgagagggt gatgtgcagc 1200 
tgaacttcga catgcccttt atcttcgcgg aggttaatgc cgaccgcatc acctggctgt 1260 
acgacaacac cactggcaaa -cagtggaaga attccgtgaa cagtcacacc attggcaggt 1320 
acatcagcac caaggcggtg ggcagcaatg ctcgcatgga cgtcacggac aagtacaagt 1380 
acccagaagg ctctgaccag gaaagacaag tgttccaaaa ggctttgggg aaacttaaac 1440 
ccaacacgcc atttgccgcg acgtcttcga tgggtttgga aacagaggaa caggagccca 1500 
gcatcatcgg gaagctgaag gtcgctggca tgctggcagt aggcaaagaa gtcaacctgg 1560 
tcctactgct caaaaacctg agcagggata cgaagacagt gacagtgaac atgacagcct 1620 
ggaccatcat ctacaacggc acgcttgtac atgaagtgtg gaaggactct gccacaatgt 1680 
ccctggaccc tgaggaagag gcagaacatc ccataaagat ctcgtacgct cagtatgaga 1740 
ggtacctgaa gtcagacaac atgatccgga tcacagcggt gtgcaaggtc ccagatgagt 1800 
ctgaggtggt ggtggagcgg gacatcatcc tggacaaccc caccttgacc ctggaggtgc 1860 
tgaacgaggc tcgtgtgcgg aagcctgtga acgtgcagat gctcttctcc aatccactgg 1920 
atgagccggt gagggactgc gtgctgatgg tggagggaag cggcctgctg ttgggtaacc 1980 
tgaagatcga cgtgccgacc ctagggccca aggagcggtc ccgggtccgt tttgatatcc 2040 
tgccctcccg gagtggcacc aagcaactgc tcgccgactt ctcctgcaac aagttccctg 2100 
caatcaaggc catgttgtcc atcgacgtag ccgaatgaag ggcgctggtg gcctcccgta 2160 
caaacttgga caacacggag cagggagagc tcaccatggg aatgaacccc ccgcccatgc 2220 
tgtccggcct gggtaaaccc tctccatctc ccaaggctgc cagacatgga cctccaggct 2280 
ccagcacatc cccctctcct ctcccccagg ttgggtctgg gtccaccctg tcctatgact 2340 
tgatcacttt tgcacattgc ctggccgctt ctccccagag ctgcctgctc tgtgagcccc 2400 
acagccctgc tcattcctca cgcccttcaa tgctgcagga tggactggct cctgacccag 2460 
ggactctcca aacgggatac aggagagaag ctggtctaga ctgtttgctg atccccaacc 2520 
tgcacggggc attcc.tgctt ctcatctcag gccaccacag agggcagggg atggttagtc 2580 
acctgcccca gcactcacac cctaactcaa aataaatgtt aaataagtgc gatcacacag 2640 
caccgactct acctgactgc tgttgcacct aaggtggtga aggagagagg aggctgtggc 2700 

<210> 11 

<211> 534 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 255828 . 29 : 2000MAY01 
<400> 11 

ctgtaccatt ggcccagctt cccgatcagt ggagacgttg aaggagatga ttaagtctgg 60 
aatgaatgtg gctcgtctga acttctctca tggaactcat gagtaccatg cggagaccat 120 
caagaatgtg cgcacagcca cggaaagctt tgcttctgac cccatcctct accggcccgt 180 
tgctgtggct ctagacacta aaggacctga gatccgaact gggctcatca agggcagcgg 240 
cactgcagag gtggagctga agaaggcagc cactctcaaa atcacgctgg ataacgccta 300 
catggaaaag tgtgacgaga acatcctgtg gctggactac aagaacatct gcaaggtggt 360 
ggaagtgagc cgactgcatc atgctgtctg gagaaacagc caaaggggac tatcctctgg 420 
aggctgtgcg catgcagcac ctgatagctc gtgaggctga ggcagccatg ttccaccgca 480 
agctgtttga agaacttgtg cgagccttaa gtcactccac agacctcatg gaag 534 

<210> 12 

<211> 2195 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 1190263 . 1 :2000MAY01 
<400> 12 

ccacgctgtc gggacattaa tttggtgccc tacactgtgg ggcctcagaa aagactcaga 60 
atccttggac gcagttggcc gaacccggag cttagttaac ccgcagggcc atttggaagc i20 
cctcccgtgg ctttaggttt ctcctccatg tggaacctga acaacaagaa ggtattccaa 180 
gcttgcacaa tggagtcgcc caggagaccc ttggtctttt catgttcttt ggtgtgtggc 240 
acttctttga acactatggc cagtgcacac atgggtgcat tgacgacaag gcctaatatt 300 
gtcctaaatc catggttgat gacctggggt atgggagatc tgggctgcta cggcaatgac 360 



7/228 



WO 01/62927 



PCTYUS01/06059 



accatgcagg 
cacatctctg 
ataccccatc 
atcttgcagt 
gaagcaagga 
tcccagaagt 
taatgtgcat 
acaacggaat 
atcctcaccc 
cttctccatg 
ccctttatac 
ggctgaacga 
taaggaaact 
ggacgatttc 
ctccatggtg 
tgtctacttt 
tggtggatgg 
ccgcgtccca 
acctacaagt 
tcaggacagg 
actcggagca 
tccccaagga 
caccagcttc 
acctaccaaa 
ccttgacacg 
tgaaggaaca 
agggcaggag 
gaagaggaag 
cagaggaagc 
actaactttg 
ccattccaga 



acgcctcaca 
ccggctccct 
cggatcacgg 
ccccgcgagg 
tacagcacgg 
gagccatgtg 
gcctgttcac 
tagcctttga 
taacctttgg 
attctgttta 
tgggactgcc 
gctggatcca 
ttccttctct 
actggcacca 
ggcaagattc 
acatcagatc 
aatggaatat 
ggaattgtcc 
ttaatggata 
gtcattgacg 
tgaatttctt 
cgacagtggg 
tggtggctgc 
acccccttct 
tgccacagag 
cccaggaaac 
gtggttgaag 
tctctaagcc 
ctggggtcct 
gtgctttcaa 
ttattaaagg 



ttcgaccgcc 
ctgcagccca 
taataggtta 
cctccctctt 
ggcttatagg 
ccacccatcc 
ctctctgttg 
gagtcagctc 
gaagctgagc 
ttttcctctt 
tcctcatgcg 
ttatggtgaa 
ttttctcctt 
gcaagcatgg 
ttgatgctat 
acggagggca 
acaaaggtgg 
gatggcctgg 
ttttaccaac 
gccgagacct 
ttccactact 
tcagtttgga 
tatgtcacct 
tgcttcttcg 
cccctctatg 
catcgtgctt 
ctttggtggt 
tcggggtcct 
aacgagaaga 
gttggcaagg 
cccactggta 



ttgccaggga 
aagccggtcc 
ctagtgggta 
aatgagacaa 
caaatggcac 
atataattag 
acatgctgct 
tggctctgtg 
ggctgggtct 
gggctatgct 
ggggcacgag 
ggaggcgatt 
tcttcacgtg 
cttgtatggg 
cgatgatttt 
tttggaagct 
aaaaggcatg 
aaaggtacca 
tgtcgcatca 
catgcccttg 
gtggctccta 
aggctcagta 
cattatgcag 
atctctccag 
atttggtgat 
gtgaccatac 
ggggtgtccc 
aaggagaaga 
gataattaca 
agtgcattta 
gttcc 



aggcgtgcag 
cgggtcctgt 
atagaagtgt 
cacttgcagc 
caaaggtttg 
tggcgttttg 
ggccgggacc 
tgcagctagt 
ctgttccctg 
tggttctcca 
atcacggagc 
tcctttttag 
cacacacctc 
gataatgtgg 
ggcctaagga 
aggcgagggc 
gggggctggg 
gctggacggt 
gtgtcaggag 
cttgcagggc 
cctgcacgcc 
tgtgacccgg 
atgttccgga 
gaacccctca 
taaaaaggtg 
caactttcag 
cattgggtct 
gatatttaca 
atcaggctac 
atagtcaata 



actgactcag 
gagggagaag 
gtttccaaat 
cttgttaaaa 
cactggcgat 
actaactaac 
ccctctcgtt 
tgccattgcc 
ggctcctgat 
gccacacgtc 
agcccatgaa 
aaaggcacag 
tccccaccac 
acgagatgga 
acaacaccct 
atgcccaact 
aaggtggaat 
tgattaagga 
gaagtctccc 
aacgtcaggc 
gtgcggtgga 
gtattcaagc 
gaacagggtt 
gagtccacac 
gccaaegecc 
aaattgaatc 
gtgtgacagg 
atcaggctac 
caaaggaagc 
aattcatcta 



420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2195 



<210> 13 
<211> 435 
<212> DNA 

<213> Homo sapiens 

<220> * 
<221> misc_feature 
<223> Incyte ID No: 



LG:270916.2:2000FEB18 



<400> 13 

gaagttttcc 

aatgatctcc 

gtgcccccca 

aggcgtcctt 

aactgcattg 

tgctgagaca 

ccactgttga 

ttcttccaca 



tccaacggct 
agatgctctc 
cccagaatgc 
gtggcagtga 
tgggagtttt 
ggtgactagg 
cggacggggg 
aacag 



tatggccctc 
cgatgcacct 
ccttccagaa 
tttggggaac 
gaaactgtgg 
gtgcactgga 
aaaagccaga 



tacgtggctt 
gctcaccatc 
gtgcttgctg 
cactgaggca 
agtcctggtc 
agaggttagc 
accgaccgct 



ctcactacaa 
tcttctgcct 
ttatccaggt 
tcaggaatta 
tggaaccaag v 
gccactagac 
ctcttatttg 



gaactctccc 60 
tctgcctcct 120 
ataggagcag 180 
gtggcttaat 240 
gggctgggtc 300 
acccaaagct 360 
actcacagct 420 
435 



<210> 14 
<211> 1574 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 999414 . 3 :2000FEB18 



<400> 14 

tagattcaat tcatttggaa acacagctcc 
gaaatgatgt cagaaaaccg aaggatgatg 
aggtcttccg attgagaact ataaggtttg 
cttgtgtcgt ccttttacat attctacatc 
agtagagtgt atttcatggc ttcttaattt 
agagagaagg acactgtcat tgcattgtat 



catttttgag tgtgcatgta cttttatgaa 60 
ataaatacta aaagtaacag gcatgtgaaa 120 
atttcatttt cagataatgg ggtcctagct 180 
aatggaagtt gtagcaccgt gtcagaataa 240 
ctttcaatta gactgagatc tttttcttca 3 00 
ttttttctga aaagagtagg ccgtatttta 3 60 
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ctgagatcac ggagttgtta tatgtgacgt tttggtcttc taatattctt cagtggattt 420 
tctctaaagt agtatgaaca gaaagccttg tatagcaaaa agtaaatcac gtaataattc 480 
tgagattttt ggaattgtca caactgagaa acattgctgg cggtgtatgg tccgcaagtg 540 
tcaagatgtt ccttgtgaat tgctcgcatc cagcattaag ggctggtttt tatcttttat 600 
ttttccaatc ctctttcctt ctcaaggtgt ccaagacaca cggagccacg gaatctcaca 660 
ggtgtctgag aattcctcct cctgggactc tcagaggatc cagaactgca gccggccctc 720 
gctttgctgt ccctgtccct gtccatgtat ctggtcacgg tgctgaggaa cctgttcagc 780 
atcctggctg tcagctctga ctgccccctc cacaccccca tgtacttctt cctctccaac 840 
ctgtgctggc ctgacatcgg tttcacctcg gccatggttc ccaagatgat tgtggacacg 900 
cagtcgcata gcagagtcat ctctcatgcg ggctgcctga cacagatgtc tttcctgctc 960 
cttgttgcat gtatagaagg catgctcctg actgtgatgg cctatgactg ctttgtagcc 1020 
atctgtcgcc ctctgcacta cccagtcatc gtgaatcctc acctctgtgt cttcttcgtt 1080 
ttggtgtcct tttttcctta gcctgttgga ttcccagctg cacagttgga ttgtgttaca 1140 
attaaccatc atcaagaatg tggaaatctc taatttggtc tgtgacccat ctcaacttct 1200 
caaccttgcc tgttctgaca gcgtcatcga tagcatattc atatatttag atagtactat 1260 
atttggtttt cttcccatct cagggatcct tttgtcttac tataaaattg tcccctccat 1320 
tctaagaatt tcatcatcag atgggaagta taaagccttc tccacctgtc gctctcacct 1380 
ggcagttgtt tgcttatttt atggaacagg cattggcgtg tacctgactt cagctgtggc 1440 
accagccccc aggagtggtg tggtggtgtc agtgatgtac actgtggtca cccccatgct 1500 
gaaccctttc atctactgcc tgagaaaaca gggacattca aagcgcctgt ggaggtgcgc 1560 
agcagaacag tcga 1574 

<210> 15 
<211> 849 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG:429446 . 1 :2000FEB18 
<400> 15 

caggtcagag aacaaaaggc ttgctgccat cttgggagtg gccgccacca tcttgggagc 60 
tctaagaaca aagacccacc cgtaacattt ggtggcaact gtacggggat tctccaaagt 120 
gtttgtctct tccagaatca aaactgtaaa actacaaatt gttcttcaaa tggagcccca 180 
gatgcagtct atgactaaga tctaccacag acccttggac cggcctgcta gcccatgctc 240 
cgatgttgat gacatcgaag gcgcccctcc caaggaaatc tcaactgcat gacccctact 3 00 
acgccccaat tccgcaggaa gcagttagag cagtcattgg ccaacctccc caacagcact 360 
tgagttttcc tgttgagagg gagaactgag agacaggact agctggattt cccaggccaa 420 
ctaagaatcc ctaagcctat ctgggaaggt gaccgtatcc acctttaaac atagggcttg 480 
caacttagct cacacccgac taatcagata gtaaggagag ctcactaaaa tgctaattag 540 
gcaaaaacag gaggtaaaga aatagccaat catctattgc ctgagagcac agcaggaggg 600 
acaatgatca ggatataaac ccaggcattc aagccagcaa tggctaccct ctttgggtcc 660 
cctccgtttg tatgggagct ctgttttcat gctattttaa agtcttgcaa ctgcacactc 720 
ttctgatcca tgtttgttac agcttgagct gagcttttgc tcaccatcca gcaatgctct 780 
ttgccaccat cacagacccg ccactgactt ccatccctct ggatccagca gggtgttcag 840 
ctgtgctca 849 

<210> 16 
<211> 443 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 057229 . 1 :2000FEB01 
<400> 16 

aaactatcct ggattgtata tatatgtata tattgtgtat atatatgtat atattatata 60 
cacatatata taaaataaca acccagtcta cctgaatttt cacattttca gaaaactgta 120 
ttgcttgaat ctaaaattgc aagacaattt attttatttt attttatttt acatattttt 180 
ttgagacaga gtcttgctct gtttcccagg ttggagtgcg gtggtgcggt cttggctcac 240 
tgcaacctct gccttctggg atcaagtgat tctcctgcct cagcctcctg agtagctggg 300 
attatagaca tgcgccaccg tgcccagctg atttttgtat ttttagtaga gatggggttt 360 
cgccatgtta gccaggctgg tctggaagca ttttaaaacc aaaaggtttg tatttaactt 420 
acttactaag attattgtat tec 443 
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<210> 17 

<211> 1393 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 3519 65 . 1 : 2000FEB01 
<220> 

<221> unsure 

<222> 120, 765 

<223> a, t, c, g, or other 

<400> 17 

gaagctgtaa aactactaat tgttcttcaa aaggagcccc agatgcagtc catgactaag 60 
atctaccaca gacccctgga ccggcctgct agcccatgca ccgatgttaa tgatatcgan 120 
agcactcccc caaggaaatt tcaactgcac aacccctact acaccccaat tcagcaggaa 180 
gcagttagag cggtcgtcag ccaacctccc cagcagcact tgggttttcc tgttgagagc 240 
ggggactgag agacaggact agctggattt cctaggctga ctaagaatcc ctaagcctag 300 
ctgggaaggt gaccgcgtct acctttaaac acggggcttg caacttagct cacacccaac 3 60 
caatcaggta gtaaagagag ctcactaaaa tgctaattag gcaaaaacag gaggtagaga 420 
aatagccaat catctatcgc ctgagagcac agcaggaggg acaatgatcc ggatataaac 480 
ccaagcattc gagccagcaa cggctgcccc ctttgtgtcc cctccctttg tatgggagct 540 
ctgttttcac tctattaaat cfctgcaactg cac.tcttc.tg gtctgtgttt gttacggttt 600 
gagctgagct ttcgctcgcc gtccaccact gctgtttgcc gccatcggag acctgccgct 660 
gacttccatc cctccggatc tggcagggtg ttcatgtgct cctgatccag agaggcaccc 720 
attgccattc ctgattgggc taaaggcttg ccattgttcc tgcangacta agtgcccggg 780 
ttcatcctaa tcgagctgaa cactagtcgc tgggttccac gattctcttc cgtgacccac 840 
gacttctaat agagctataa cactcaccgc acggcccaag attccattcc ttcgaatccg 900 
tgaggccaag aaccccaggt cagagaacat gaggcttgcc accattttgg aagtggcctg 960 
ccgccatttt ggaagtggcc tgccaccatc ttgggagctc tgggagcaag gaccccctgg 1020 
taacagtagt agatacatag aatttttaaa taaacatact tatattaagg gtactcttag 1080 
ggtctatact aagaaatttt gcatgcttgt cattaaaagc tttgagtcca agtcctgtgt 1140 
gtgtgtatat gactttgact ctaagtcaag tgtaaatgta actgtataga tatgccatgt 1200 
atataatgta aataaaatat atacatatat ttataaataa ggcagtctga aaacttgctt 1260 
gctggtgaac tttaaaacag cctctttctc gtgatatacc cctccagttt atccagtgga 1320 
aaagaaaatt actgctggat taaaaacaag tgcttaggct aggtgtggta cctcatgcct 1380 
gtaatttcag cac 1393 

<210> 18 

<211> 884 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 068682 . 1 :2000FEB18 
<400> 18 

acagcggcca atgtctggca gtgggcacat gggggtgcgg gggtgtaggt gccaagcgcc 60 
atggcttaga cccgagattg gagtccggcc gcccccccgc agccaagccg cctcctgccc 120 
cccgtgcgcc ctaggcgcca ccatgtcggg agacaaactt ctgagcgaac tcggttataa 180 
gctgggccgc acaattggag agggcagcta ctccaaggtg aaggtggcca catccaagaa 240 
gtacaagggt accgtggcca tcaaggtggt ggaccggcgg cgagcgcccc cggacttcgt 300 
caacaagttc ctgccgcgag agctgtccat cctgcggggc gtgcgacacc cgcacatcgt 3 60 
gcacgtcttc gagttcatcg aggtgtgcaa cgggaaactg tacatcgtga tggaagcggc 420 
cgccaccgac ctgctgcaag ccgtgcagcg caacgggcgc atccccggag ttcaggcgcg 480 
cgacctcttt gcgcagatcg ccggcgccgt gcgctacctg cacgatcatc acctggtgca 540 
ccgcgacctc aagtgcgaaa acgtgctgct gagcccggac gagcgccgcg tcaagctcac 600 
cgacttcggc ttcggccgcc aggcccatgg ctacccagac ctgagcacca cctactgcgg 660 
ctcagccgcc tacgcgtcac ccgaggtgct cctgggcatc ccctacgacc ccaagaagta 720 
cgatgtgtgg agcatgggcg tcgtgctcta cgtcatggtc accgggtgca tgcccttcga 780 
cgactcggac atcgccggcc tgccccggcg ccagaaacgc ggcgtgctct accccgaagg 840 
cctcgagctg tccgagcgct gcaaggccct gatcgccgag ctgc 884 

<210> 19 
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<211> 1116 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID NO: LG: 242665 . 1:2000FEB18 
<220> 

<221> unsure 

<222> 1087, 1090 

<223> a, t, c, g, or other 

<400> 19 

caccacctca tttcgcttta tttcacagac ttccccatct cattttttat gttctatgct 60 
aatttctcaa gaagaacagg cccggcacca cctctgcgca caaccccgcg ggcctggctc 120 
aggcgggagt gcggggccag caccatgagc gccccgggca gccccgacca ggcctatgac 180 
ttcctgctca agttcctgct ggtgggcgac agggacgtag gcaagagtga gatcctggag 240 
agcctgcagg atggtgcagc tgagtccccg tacagccatc tcggggggat cgactacaag 300 
acgaccacca tcctgctgga cggccagcgg gtgaagctga agctctggga tacgtcgggg 360 
cagggaagat tttgtaccat attccgctcc tactctcgtg gtgcacaagg agtgatcctg 420 
gtctacgaca ttgcaaaccg ctggtctttc gagggtatgg atcgatggat taagaagatt- 480 
gaggaacatg cccctggtgt ccctaaaatc ctggtgggga atcgcctaca tctggcattc 540 
aagaggcagg tgcccaggga gcaggcccag gcctacgccg agcgcctggg tgtgaccttc 600 
tttgaggtca gccctctg.tg caatttcaac atcatagagt ctttcacgga gctggccagg 660 
atagtgctgc tgcggcacag gatgaattgg ctcgggaggc cgagcaaggt actgagcttg 720 
caagacctct gctgccgcac catcgtgtcc tgcacacctg tgcatctggt ggacaagctc 780 
ccgctcccca gtaccttaag aagccacctc aagtccttct ccatggctaa gggcctgaat 840 
gccaggatga tgcgaggcct ctcctactcc ctcaccacca gctccactca caaaaggagc 900 
agcctctgca aagtgaagat cgtctgccca ccccagagcc cacccaaaaa ctgcaccaga 960 
aacagctgca aaatttctta aggaaggcac caaaaggaaa caagctggaa tcgctccagg 1020 
aaaaactctg gttacacctg gaagatggaa gtcgcattgt agattcaaga aattagtttt 1080 
cagtttntcn cgggaaccat gctgcttggg aatgtg 1116 

<210> 20 
<211> 585 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 241743 . 1:2000FEB18 
<400> 20 

ccggcccaga atgtagaaac ggttaggagg gacctgtgaa aaagcgggac accaagaaga 60 
gcactgttgc tgtttttagt ttttaaaata agaggaagcc cagtgtccca tctgatgccg 120 
aggctgaagg agtctcgctc ccacgagtcc ctgctcagcc ccagcagtgc ggtggaggcg 180 
ctggacctca gcatggagga agaggtggtc atcaagcccg tgcacagcag catccttggc 240 
caggactact gcttcgaggt gacgacgtca tcaggaagca agtgcttttc ctgccggtct 300 
gcagctgagc gggataagtg gatggagaac ctccggcgag cggtgcatcc caacaaggac 3 60 
aacagccggc gtgtggagca catcctgaag ctgtgggtga tcgaggccaa ggacctgcca 420 
gccaagaaga agtacctgtg cgagctgtgc ctggacgatg tgctctatgc ccgcaccacg 480 
ggcaagctca agacggacaa tgttttctgg ggcgagcact tcgagttcca caacttgccg 540 
cctctgcgca cggtcactgt ccacctgtac cgggagaccg acaag 585 

<210> 21 

<211> 2950 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 034212 . 1 :2000FEB01 
<220> 

<221> unsure 
<222> 52, 132 
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<400> 21 

tactatattg ctcagcattt ctaagtattc tctaagtgct ctttatttat gntttaaaat 60 
agctctctta cccagctgcg aagactagaa gaacttgatt taggaaacaa tgaaatatat 120 
aatttgccag antcaattgg agccctctta catctaaaag atctctggtt ggatggaaat 180 
caactgtcag aattacctca ggaaatagga aatctgaaga acctgctgtg tttagatgtc 240 
tctgaaaaca ggttggaaag acttcctgaa gaaatcagtg gcctgacttc attaacggat 300 
ttagtcattt cccagaactt attagaaacg attccggatg gcattggaaa actaaagaaa 360 
ctgtcaatct tgaaggtgga tcagaataga ctcacacagt tgcctgaagc agttggggaa 420 
tgtgaaagtc tcactgagtt agttcttaca gaaaatcagc tcctgaccct gcctaaaagc 480 
attggaaaac taaagaagtt gagcaacttg aatgcagaca gaaataaatt agtgtcctta 540 
ccaaaagaga tcggcgggtg ctgcagcctc actgtgttct gtgtacgtga caacagacta 600 
actcggatac ctgcagaggt gtcacaggca acagaacttc atgtcctgga tgtggcaggg 660 
aacaggttgc tgcatctacc tttatccctg actgccttga agttgaaggc tctgtggcta 720 
tctgacaacc agtcccagcc cctgcttaca ttccagacag acacagacta caccacagga 780 
gagaagattt taacctgtgt cttacttcct cagctgcctt ctgaacctac ttgtcaagag 840 
aatctgcctc gctgtggtgc actggagaac ttggtaaatg atgtctctga tgaagcctgg 900 
aacgagcgtg ctgtcaacag agtcagtgcg atccgatttg tggaggatga gaaagatgaa 960 
gaagacaatg agacgagaac acttctaagg cgagccactc cacacccagg ggagttaaag 1020 
cacatgaaaa agacagtgga gaatttacgg aatgacatga atgctgctaa aggactggac 1080 
tcaaacaaaa acgaggtcaa tcatgccatt gaccgagtga ccacttctgt gtagagtttc 1140 
acctccaagt tttacctcct gtgtcttcct ctgctgtcga gacgttcctg tctgcttccc 1200 
gggagcctca cgtgctcctt gtcctaacca gcccccgcgc gccatcttcc cgtggagtgt 1260 
ggggaagctg etg.hetccca ggaagtgcct tactcatccc gcaaccagtc agcgcaccag 1320 
tggtctcccg gtgtgatttt ttttttttta atttcagttg tttgtaataa gtagaataca 1380 
ctactgtaaa catacgacct ttgtttttgt cttatgttgg ggtaaaggaa agcaggaagg 1440 
ggaattttta tcctcctccc ttccgtaaag tgctgggata ttttgaatcc cccaagttcc 1500 
cttggaccta ctgatgagag atagttttat gtatggggaa aaatggatac tttttaaacc 1560 
ttttttggca gctcagatgg tgtaaatttt aaaattttgt ataggtattt cataacaaaa 1620 
atatgtattt cttttttgtt attttatctt gaaaacggta catattttag tatttgtgca 1680 
gaaaaacaag tcctaaagta tttgttttta tttgtaccat ccacttgtgc cttactgtat 1740 
cctgtgtcat gtccaatcag ttgtaaacaa tggcatcttt gaacagtgtg atgagaatag 1800 
gaatgtggtg ttttaaagca gtgttgcatt ttaatcagta atctacctgg tggatttgtt 1860 
tttaaccaaa aagatgaatt atcaatgatt tgtaattata tcggttgatt ttttttgaaa 1920 
agatgaacca aaggatttga ctgctaatat tttattcctt acactttttt tctgaataag 1980 
tctctcataa tgagtgcagt gtcagactgt gcctactctg atggtatgtg ccatttgtaa 2040 
aataaaatag agcagaaaaa cacaaaaaga gaacactggt tcagacattc agtgggcaag 2100 
taaattatgg actgcaaaat aatgattttt attcaagaaa gctttaaaag ttttatatcc 2160 
agatatacaa ccacaataaa gcaaaataac ctactatcaa aatagaaatg ttgctatctt 2220 
tataagtgca atttaatttg taaatagagt ttgaatcaaa gtatcacaaa atactgcttc 2280 
aagatttaat tttaaatctg ctaatttaag ggatattggg aaaagttttg gtgtgtttct 2340 
gttgatttct tttttgtatg ctgtgataaa agagaaatga aaagtgccag tcactgtgtg 2400 
gtgtctagga aaatcatata tatttttttc tccaagaaat aaattcatcc tggacattgg 2460 
ccatacagct ttttaaaatt attactttgt atgttcaagt gatagcaggt agccaaattc 2520 
tttgacagtg tgctctggtc tgttaaatat ctaaattacc cgtcagttgt gagtgacctc 2580 
ctgtgggact tgcattcaca tggggcagag cccagaattg cctttgactc tggctagtaa 2640 
ttttgggttg tggctatctg gccaattgga ctccttataa acccgtcttc aaccaaccaa 2700 
ataccatact tttttttact tcctatacat ttacttattg ggttcactta gctttttagt 2760 
ctgtgtgtat gtaaatatgt gtgtatttta aaaactattt atggagctga aagtacataa 2820 
cacataagtg agtttcggct ggtcagtttt ataccttgtt ccctaatctg tgcataccaa 2880 
ttttcttttc tctgttttaa gatttttgac atcctctatt ttgttttaag cttataaaga 2940 
taatgccact 2950 

<210> 22 

<211> 1973 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:344886 . 1 : 2000MAY19 
<400> 22 

gttctgccga gtggggcgcc gcggccgcta ttgtcccgcc ccctgctccg caagattcga 60 
gcctgagcgg cctgggcgtc tcgagaggtg agagagttgg cggcgaggtc tcggcggcta 120 
agcgagcgtc ggcgactgtc tctccgcgag aggaggcaag ttggggtcca ggctccaaag 180 
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ccggtggccg cgtaccgcgg tggagccgct gtctttgagg tctgaggaga gaacagacag 240 
agtctagctc tttcacccag gctggagtga agtggtgcaa tctcagctca gtgcaacctc 300 
cgccccctgg gttcaagtga ttctcctgcc tcagcctccc cagtggctgg gattacagtg 360 
atatcctgag agaagatggg aaagggctgc aaggttgtgg tttgtggatt gttatctgtg 420 
gggaaaactg caattttgga gcagctcctt tatggaaatc atactattgg aatggaagat 480 
tgcgaaacaa tggaagatgt atacatggct tcagtagaaa cagaccgagg agtaaaagaa 540 
cagttacatc tttatgacac cagaggtcta caggaaggcg tggagctgcc aaagcattat 600 
ttttcatttg ctgatggctt cgttcttgtg tacagtgtga ataaccttga atcctttcaa 660 
agagtggagc ttctgaagaa agaaatcgat aagttcaaag acaaaaaaga ggtagcaatt 720 
gtggtattag gaaacaaaat cgacctttct gagcagagac aagtggacgc tgaagtggca 780 
cagcagtggg caaaaagtga gaaagtaaga ctgtgggagg tgactgttac agatcggaaa 840 
actctgattg aaccattcac tttattagcc agtaaacttt ctcaacccca gagcaaatca 900 
agctttcctt tgcctgggag gaaaaacaaa gggaactcta attctgagaa ctaaaaatca 960 
gtaatttcca caattgtatg ttgaatagtg attgccttta agtgtctgtg aacatggagt 1020 
aatattacta tttaaaatag gccatttgta tctacctttg gtccttagga aaattcctaa 1080 
ggaagtcaat taatgcactt tagatgttaa aagtatttgg gctaaggtta ttattgcctg 1140 
atatgaaata atatattctt attctcattg tttgaaacct gtctttgaaa ttagcacctt 1200 
tgttatttat gttgtacttg tgaaaacagt aaaatagttt ggatagttat gcaaatgcac 1260 
ctatgtgtaa cttcccccca accccaagct gtttcggaag atatcataat cattctgtgt 1320 
aacattatgc aaacttctaa gcccaaacat gactttgttt ttaaaaagtt cattaatcta 1380 
atgtctagga ttataaaaca tttttttgtg tctaaattgg acccaaaaca ttgaacagtt 1440 
tggggtagta agctaaattt catcttgtgg agattttgct aaacagacta agacccatga 1500 
tttagctttg ctcaaattag aatgtttagc atgagttgag gtaccaggta gtgttaagta 1560 
ggttcatcac gctctaaggc cgttttttcc ttagccagac ccctgttgat agaccagata 1620 
cttgagggca aactgtttgc tcctcctctt gaaaatgatt aggcacttaa ggacagtaaa .1680 
gctgtatttt ctggaaggaa gactgtatct tctggaatag ttttctagaa aactagtcat 1740 
atacaataaa agtatcaaaa atattgggct ctaatttgat ctgacttaga tgtctgagtt 1800 
tgtgttgttt ctctaaagat tttggcaaga ctcaagcaat gtggctgact gtaactttat 1860 
taatttaaaa ggtaggaagt aagctactta gtggtttcac ctgtgaaata actattttga 1920 
ctgaaatgta aaataagcta ttcaacaaag aacatattaa aacatcaagt aat 1973 

<210> 23 

<211> 683 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID -No: LG:228930. 1 :2000MAY19 
<400> 23 

gccggagccc agatgagccc 
caagcccgtg tgcgtgtccg 
tgactcaact attgttactc 
ctggacagat caaaacaagg 
atgtgggaga cacgccaaag 
gaacatcggg cgcctggagc 
catctcccat ttgaatctcg 
ggttttcagt ttggcaacaa 
aaaagcctat actaaactta 
catatatcgc ttgttttcag 
tcttccatct tcaagggtgg 
aaaggctccc ccaaaacatt 

<210> 24 
<211> 1435 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG :33 8927 . 1:200 0MAY19 
<220> 

<221> unsure 

<222> 1409, 1432 

<223> a, t, c, g, or other 



agatggccgg ggctcaaccc 
acagcctcaa gaagggcacc 
caattatttt gaggactgac 
agacagagct actggatctc 
ctcccaagga ccccaaatta 
agcgcatgat cacagtggtg 
tggctttcca agaagaagtg 
acctgctggc ccaaaacatg 
agctgcaagt cactccagaa 
cagatcggaa gcgagttgaa 
aaaaagctaa tgaggctgcg 
ttc 



ggagtgcacg ccttgcaact 60 
aaattcgtca agtgggatga 120 
cctcagggat ttttctttta 180 
agccttgtca aagatgccag 240 
cgtgaacttt tggatgtggg 3 00 
tatgggcctg acctcgtgaa 360 
gccaaggaat ggacaaatga 420 
tccagggatg catttctgga 480 
gggcgtattc ctctcaaaaa 540 
actgctttag aggcttgtag 600 
aaaagtgagc agagctgtgg 660 

683 
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<400> 24 

gtgagccgac cttctgagcc 
atctgtccca ggccagcgtg 
ggtagctgag atccaccccg 
catctgtgtc caacctggtc 
gaggccagag gctagaggac 
gaccacggga gaagacgaat 
ggaggtacct gaactccctg 
agcctgtgac cctctcagga 
tgctggagac agagcaggcc 
aggagctgct gaagacagcc 
tcttctccaa catctcctcc 
agcggcgcct ggacgactgg 
ctggccaggt ggaggtgggg 
cttattgtgt ggccatgcct 
gaatgagctc tgagaggctg 
gtcctggcca agccaagtga 
gagggcttcc tacacgcctg 
tcagcctgct gtgtgatgac 
tggccactaa agcatttaac 
ggatgtcctt taagatcttt 
attgattgct taaaacattt 
catacctggc gcatgcatat 
ttacctggga tgaagaataa 
agtgccagtc atgatagtag 

<210> 25 
<211> 1508 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: DG: 898771 . 1 : 2000MAY19 

<400> 25 

ggtgacccag cgacctgtgt gtcccacagc ccgtccgtcc ctctgtcagt ctccctgtca 60 
ctggcatgtc ctgggtggtc ccagtggcac cttctaggcc cggcccctgg atgcggtgga 120 
ggctgcccgg ggtgggtcag catggggttg tccctgtgag cacagctccg ggagacccgt 180 
actcagaatc ctccagctct ctcgcctgag gcggaaatgt gacttgggaa ttgtcctcca 240 
tataaaggaa gcaaaatcag ggcctagggg ctagagtgca gtggtgtgat catagctcac 300 
tgcagcctcg acctcccagg ctcaagccat cttcctgcct cagcctcctg aacagctgga 360 
actgcagtgt ccccaactcc tgcccagcca gtccacgcgg tgccggctcc tccagttacc 420 
gctttgtgca gaacgtgacc tcggacctgc agctggcagc agagtttgca gcaaaggccg 480 
cgtcggagca gcaggcagac acgtctggag gagacagccc caaggatgag tcaaagccgc 540 
cgttctccta cgcgcagctg atcgtgcagg ccatctcctc cgcccaggac cggcagctga 600 
ccctgagcgg gatctacgcc cacatcacca agcattaccc ctactaccgg acggccgaca 660 
aaggctggca gctgaagccg agtccccagg gccggatcgc ctctgaaggc ccttagaaca 720 
tggaactcac aggcatattg cttcccttaa aacagaagag acgcggcgcg cccgccctgc 780 
agctgcccaa gcagtgtacc ttccggtttc ccagcacggc catcaagatc cagttcactt 840 
cgctctatca caaagaagag gccccagcct ccccgctgcg gccactgtac ccccagatct 900 
cccctctgaa gatccacatc ccggagccgg acctccggag catggtcagc cccgtcccct 960 
ccccgacggg caccatcagt gtccccaact cctgcccagc cagtccacgc ggtgccggct 1020 
cctccagtta ccgctttgtg cagaacgtga cctcggacct gcagctggca gcagagtttg 1080 
cagcaaaggc cgcgtcggag cagcaggcag acacgtctgg aggagacagc cccaaggtct 1140 
gagcccacct ggcgccgtgg tgcacctggt gacccaggat cgggccatag tgacccggcg 1200 
cgggcggcac agacagccca gggcctgcgg gcgtgtgctc gaggtggtct cagcccacag 1260 
ggaatggagc cgcagctggc ggtagagccg tcggcccttt ctcgttttcg tgcccaaatc 1320 
atccccgttg gtcatttgct ggtgacaggg gtggcgccgt caccgagggc ccctcccgcc 1380 
acgctccttg ttcatctgcc gctggcagcc cggcgtcaaa gccttcatac agcccttgag 1440 
caggatggcg ttctccttat aactttgtct tagttcattt ttgtccaata gctgttttca 1500 
aattgatt 1508 

<210> 26 
<211> 574 
<212> DNA 

<213> Homo sapiens 



ctcaagaaag atcagaacag 
gctgagtgtg ctggctggag 
gaaaccggca ggatgaaggg 
actgtgtttg agaatagcag 
gtgcatcacc gccctgagtg 
gtcggggagg ccgtggggtc 
aagaacaagc tgtccagcga 
tcggggacgc aggagccaga 
tatgtggcgc gcctccacct 
cgcagcagca aggccttccc 
atctatcagt tccattctca 
tgaggtccac caggagcccc 
agaccccagg gcgctaggtt 
ccctggggct cggtctgtct 
tgtgtgtgca gactgcttcc 
ccaagtgact ttgggacagt 
gccttgggct ttccctctat 
ggcatcatgg ttatgcaaga 
agtgaagtta cttgccttat 
ggcaaagaag aatgctagta 
ataataaact gttgtttttt 
tttttaagag tacccactgg 
agagagcaaa ctaccacaag 
taataatant aataaacatc 



attcatgggt gatttagcct 60 
gcctctctct ctgcttcgag 120 
ggcaagtgag gagaagctgg 180 
gaccccagaa gcagcaccca 240 
caggcctccc gagtccccag 300 
tgagcccagg acagtcagca 360 
agcctggagg aaatcttgcc 420 
gaagaagatc gtccaggagc 480 
gctagaccag gtgtttttcc 540 
agaggatgtg gtcagggtca 600 
gttcttcctc ccagagctgc 660 
tgaggcctca gaccacagcc 720 
cagcctccta ggcccaggcc 780 
gcacagtgaa gggcttgggt 840 
tggggaaagg ggtggatcag 900 
gaaaatagcc agtgccaact 960 
taactccctc aatcttcaca 1020 
aaccctcttg ttttcacaga 1080 
atagtttgcc acgaagtcta 1140 
attgttaaac ctagtgatgt 1200 
tttaagataa taatagccac 1260 
agtctaatat atagcctggg 1320 
caatggttga gcccctgtca 1380 
tattgggcca anaaa 1435 
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<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 257664 . 67 :2000MAY01 
<400> 26 

cgacctaaga attgccggga ggatggatgg ctggcgcgcg aattgttgct tttgctcttg 60 

gctgatggtt gaggaatgta gagagactgt gtggggtttg tgtgacactg cggtgccttg 120 

cagattaggt gagctttcaa tctctcttta aaaggggcgg cgaagaggac ccgcaggctg 180 

caaggagcaa cagcgatggg taggcacatt caaaaccaga aaaaattgtt aacaaaccaa 240 

acaaacatga agatcatact gggaaactga ggccagagac ccgggaagaa aataaaaatc 300 

atttaaagga ccatcagcca tattggcata catttgtgaa taacacacaa tttcctgaca 360 

tttgggagca agtaaagtgc gtaacataaa tttttttggc catgcttaaa gtagaatgag 420 

atccagctgg gggaaagtta tagatttcag ccttggatga tcagtttagg atttcgagaa 480 

ttgatgtttt tccatatggt ttctgttttc ccccagctct ccatttgaac cagctgcttg 540 

ttttaaaact atctactaaa ttccatctac ccac 574 

<210> 27 

<2U> 1969 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__feature 

<223> Incyte ID No: LI : 001496.2 :2000MAY01 
<400> 27 

ggggcgctgg gggtgacggt gcggagccgc tgccagcgct gggcgagagt cggcggccgg 60 
atccgaggag caggcgggcc tgaggccgag tcagctgcgc gggcccccgg atcccccgac 120 
agagacggcg gcggtgtctg gccaggcggt aggcgctgcc tggcccgcgg cggggaagat 180 
gttcagcgta gagtcgctgg agcgggcgga gctgtgcgag agcctcctca cttggatcca 240 
gacatttaat gttggatgca ccatgccaga acccgtggaa gatttaacga atggggttgt 300 
gatggcccag gttcttcaaa agatagatcc tgcatatttt gatgaaaatt ggctaaacag 360 
aatcaaaact gaagtaggag ataattggag gctaaagata agcaatttaa agaaaatttt 420 
aaaaggaatc ttggattata atcatgagat tttaggacag caaattaatg actttaccct 480 
tcctgatgtg aaccttattg gggagcattc tgatgcagca gagcttggaa ggatgcttca 540 
gctcatctta ggctgtgctg tgaactgtga acagaagcaa gagtacatcc aagccattat 600 
gatgatggag gaatctgttc aacatgttgt catgacagcc attcaagagc tgatgagtaa 660 
agaatctcct gtctctgctg gaaatgatgc ctatgttgac cttgatcgtc agctgaagaa 720 
aactacagag gaactaaatg aagctttgtc agcaaaggaa gaaattgctc aaagatgcca 780 
tgaactggat atgcaggttg cagcattgca ggaagagaaa agtagtttgt tggcagagaa 840 
tcaggtatta atggaaagac tcaatcaatc tgattctata gaagacccta acagtccagc 900 
aggaagaagg catttgcagc tccagactca attagaacag ctccaagaag aaacattcag 960 
actagaagca gccaaagatg attatcgaat acgttgtgaa gagttagaaa aaggagatct 1020 
ctgaacttcg gcaacagaat gatgaactga ccactttggc agatgaagct cagtctctga 1080 
aagatgagat cgacgtgctg agacattctt ctgataaagt atctaaacta gaaggtcaag 1140 
tagaatctta taaaaagaag ctagaagacc ttggtgattt aaggcggcag gttaaactct 1200 
tagaagagaa gaataccatg tatatgcaga atactgtcag tctagaggaa gagttaagaa 1260 
aggccaacgc agcgcgaagt caacttgaaa cctacaagag acaggtagta gaactacaaa 1320 
acagattatc cgaagaatca aagaaagcag ataaactaga ttttgaatat aagcggctaa 1380 
aagaaaaagt tgacagtctt caaaaagaaa aggacaggct gagaacagaa agggattctc 1440 
tgaaggaaac cattgaagag cttcgttgtg tacaagctca agaaggccag ctcacaacac 1500 
aagggttaat gcctcttgga agtcaggagt cttcagacag tctagctgca gagattgtta 1560 
cacctgaaat cagggagaaa cttattcgtc ttcagcatga gaataagatg ttaaagctta 1620 
accaagaagg ttcggacaat gaaaaaatag ccttattgca gagccttcta gatgatgcaa 1680 
atctacgcaa gaatgaactg gagacagaga ataggctggt gaatcaaaga cttctggaag 1740 
tacagtcaca agttgaagaa ttacaaaaat ctttacagga tcaaggctca aaagcagaag 1800 
atgctatttc agtccttcta aaaaagaagc ttgaagaaca tctagagaag ctgcatgagg 1860 
ccaataatga actacagaag aagagagcca ttattgaaga tctcgagcca agatttaaca 1920 
acagctcctt aaaaattgaa gaattaccaa gaagctttac gaaagaaag 1969 

<210> 28 

<211> 2476 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
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<223> Incyte ID No: LI : 1085273 .2 :2000M&Y01 
<400> 28 

cgggccccgc cgggcccatg gacggcgcgg ccgagcgggc gccctgagcg cggtgcgggt 60 
cctcggagca cccccgaggt gagcgcaagc aaggtccggc atcatgtgct aggttattcc 120 
cagtgcgagg ccacacttgg gccgtcggag cage ccc tec tcacttcagg ggtcaccctc 180 
cccaagaccc attgccccat gatggccggg gaccggctcc ccaggaaggt gatggacgee 240 
aagaagctgg ccagcctgct gcggggcggg cctggggggc cgctggtcat cgacagccgc 300 
tccttcgtgg agtacaacag ctggcatgtg ctcagctccg tcaacatctg ctgctccaag 360 
ctggtgaagt ggcggttgca gaagggcaag gtgaccattg tggagttcat ccagccggct 420 
gcacgcagcc aggtggaggc cactgagcca caggacgtgg tggtctatga ccagagcacg 480 
cgggacgcca gcgtgctggc cgcagacagc ttcctctcca tcctggctga gcaagctgga 540 
cggctgcttc gacagcgtgg ccatcctcac tgggggcttc gccaccttct cctcctgctt 600 
ccccggcctc tgegagggea agcctgctgc cctgctaccc atgagectet cccagccctg 660 
cctgcctgtg cccagcgtgg gcctgacccg catcctgcct cacctctacc tgggctcgea 720 
gaaggacgtc ctaaacaagg atctgatgac gcaaaatgga ataagctacg tcctcaacgc 780 
cagcaactcc tgccccaagc ctgacttcat actgegagag ccgcttcatg cgggtcccca 840 
tcaacgacaa ctactgtgaa aaactgctgc cctggctgga caagtccatc gagtacatcg 900 
ataaagecaa gctctccagc tgecaagtea tcgtccactg tctggctggc atctcccgct 960 
ctgccaccat cgccatcgcc tacatcatga agaccatggg catgtcctcc gacgacgcct 1020 
acaggttcgt gaaggacagg cgcccgtcca tctcgcccaa cttcaacttc ctgggccagc 1080 
tgctggagta cgagcgcacg ctgaagctgc tggccgccct geagggegae ccgggcaccc 1140 
cctcagggac gccggagcct ccgcccagtc ctgccgccgg ggccccgctg ccacggctgc 1200 
caccacctac ctcagagagc gctgccacag ggaatgegge tgecagggag ggcggcctga 1260 
gegegggegg ggagcccccc gcgcccccca cgcccccggc gaccagcgca ctgeagcagg 1-320 
gcctgcgcgg cctgcacctc tcctcggacc gectgeagga cactaaccgc ctcaagcgct 1380 
ccttctccct ggacatcaag tctgcctacg cccctagcag gcggcccgac ggccccgggc 1440 
cccccgaccc cggcgaggcc ccgaagctct gcaagctgga cagcccgtcg ggggccgcgc 1500 
tgggcctgtc ctcgcccagc ccggacagcc cggacgccgc gectgaggeg cggcccacgg 1560 
ccccgccggc ggccccggcc ccccgccggc tcccccgcgc gctcccccgc gcacagcctc 1620 
ggectgaact teggegatge ggcccggcag actccgcggc acggcctctc ggccctgtcg 1680 
gcgcccgggc tgcccggccc tggccagccg gccggccccg gggcctgggc accgccgctt 1740 
gactccccag gcacgccgtc gcccgacggg ccctggtgct tcagccccga gggegcac^g 1800 
ggggcgggcg gggtgctgtt tgcgcccttc ggcegggegg gcgccccggg accaggcggc 1860 
ggcagcgacc tgcggcggcg ggaggcagcg agggctgagc cccgggacgc gcggaccggc 1920 
tggeccgagg agccggcccc ggagacgcag ttcaagcgcc gcagctgcca gatggagttc 1980 
gaggagggca tggtggaggg gcgcgcgcgc ggegaggage tggccgccct gggcaagcag 2040 
gegagcttet egggcaaegt tgaagtcatc caagttgtct ctgacccttc ggctgccttc 2100 
ggccccgccg cccgcagcca ggcccgtaat aaatgtatat tatatataat gcaaagaaag 2160 
gtaaatggtt taactgggaa ttttaatccg agaagtaaaa taatttcgat tttttattta 2220 
tttaagctgt tcattctggc aatggatttg gcaacagtgc gggtggtcct cgagcttcca 2280 
tttttactgt ctggtattta aactgaaaca tacgtttcta agcaatacga ggccaccttc 2340 
cagtegcaag ctgggtgcca ggcctggggc ccctcccagt tcccccgccc caggaaacac 2400 
tgetgacett tgeaaagget gecgagcttt cgtgcacttt ttacataaca aaaaggtgaa 2460 
aaaaaggaaa aaaaaa 2476 

<210> 29 
<211> 2689 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI: 333138 .2 : 2000MAY01 
<400> 29 

aggagctgga gtgggagctc aagcaggatt cttcccgagt ccctgggtaa gacaaccctg 60 
gcttcttttc ttggctttag agggtctcct tgcttaatgg gaagcgtgca gcacctagtg 120 
agtggatttg aagagecact ttgtaacjcaa cttgggcatt tatttcagee ccagttccag 180 
tcttccctga ctcttttggc atcaaggcat cc tcagaagc ttcaactctg gaggcaatgg 240 
gtcgaaagga agaagatgac tgcagttcct ggaagaaaca gaccaccaac ateeggaaaa 300 
ccttcatttt tatggaagtg ctgggatcag gagctttctc agaagttttc ctggtgaagc 360 
aaagactgac tgggaagctc tttgctctga agtgcatcaa gaagtcacct gccttccggg 420 
acagcagcct ggagaatgag attgctgtgt tgaaaaagat caagcatgaa aacattgtga 480 
ccctggagga catctatgag agcaccaccc actactacct ggtcatgcag cttgtttctg 540 
gtggggagct etttgacegg atcctggagc ggggtgtcta cacagagaag gatgecagtc 600 
tggtgatcca gcaggtcttg tcggcagtga aatacctaca tgagaatggc atcgtccaca 660 
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gagacttaaa gcccgaaaac ctgctttacc ttacccctga agagaactct aagatcatga 720 
tcactgactt tggtctgtcc aagatggaac agaatggcat catgtccact gcctgtggga 780 
ccccaggcta cgtg^ctcca gaagtgctgg cccagaaacc ctacagcaag gctgtggatt 840 
gctggtccat cggcgtcatc acctacatat tgctctgtgg atacccccca ttctatgaag 900 
aaacggagtc taagcttttc gagaagatca aggagggcta ctatgagttt gagtctccat 960 
tctgggatga catttctgag tcagccaagg actttatttg ccacttgctt gagaaggatc 1020 
cgaacgaggg ggtaacactg tgagaagggc ttgagtcatc cctggattga cggtaacaca 1080 
gccctccacc gggacatcta cccatcagtc aggcgtccag atccagaaga actttgctaa 1140 
gagcaagtgg aggcaagcct tcaacgcagc agctgtggta gcacacaatg aggaagctac 1200 
acatgaacct gcacacgccc gggcgtccgc ccagaggtgg agaacaggcc gcctgaaact 1260 
caagcctcag aaacctctag acccagctcc cctgagatca ccatcaccga ggcacctgtc 1320 
ctggaccaca gtgtagcact ccctgcctct gacccaatat accctgccca gcatcgccgc 1380 
cggcccactg cccttggtgg caggtccctc aactgcctgg tcaatggctc cctccacatc 1440 
agcagcagcc tggtgcccat gcatcagggg tccctggccg ccgggccctg tggctgctgc 1500 
tccaagctgc ctgaacattg ggagcaaagg aaagtcctcc tactgctctg agcccacact 1560 
cctcaaaaag- gccaacaaaa aacagaactt caagtcggag gtcatggtac cggttaaagc 1620 
cagtggcatg cttccactgc cgggcagggc agactggagt ctgtctcatt atgtgattcc 1680 
tggagcctgt gcctatgtca cgtgcaattt tccaggagac atattcaact cctctgctcc 1740 
tccaaacctg gtgtctatcc ggcagaggga ggaaggcaga gcaagtggag cagggcttag 1800 
caggagcagt ttctgggcca gaagcaccat gcctgcttgc cagctggggc agccccgtca 1860 
taggaggccc aggagggagc cccaaggcgt agaagccttg gttgaagctg tgagcaggag 1920 
aagcggtgcc caccagcttc cagtgtctcc cgtgacctgg cctgctctat gccccacacc 1980 
ctacagtggc cgtggctctg tgcagtgtta cgttagatag ctctcgccct gggtctgtgc 2040 
tgtttgtcgt gaaaagctta aatgggctgg gccaggctgt gtcaccttct ccaaagcaaa 2100 
gcccatatgt gagcatctat cccagagatc ccaetctgtg cacactctct ctcacatctc 21-60 
taaagtctcc aacctcttgg ccagattgcg ctcattaatg gtcgttgcct gcccatctgc 2220 
catgaatacc agccagctcc ccatggtggt ctccctggtg agctcttcaa gttctaatcc 2280 
ttaactccag gattagctcc caagtgcgct gagaaccaag caagcacact ttcttggccc 2340 
ttctccctgc ctcaatctaa aagcagtgcc acaccctcca aaggtggaat agaaagaagt 2400 
tcatgagtaa gaggctgcaa cgttgaacta cttagtccat gagtccagca ctgtcctccg 2460 
tgcacagcag cccaaatgga cgttaacctt ggaagttgac ttatatttaa gtgtctgcca 2520 
ggaggttcta agtcctgcct ctggttcccg ttttctctcc ttgaaatgtg ccagtcgaca 2580 
tccatatctt ggtccgttcc ccaggtatgt cctcggcccg tcgcaccccg tccaggcttc 2640 
atgctcagag ttgtgcttaa taaaatggac atatttttct ctaaaaaaa 2689 

<210> 30 

<211> 1391 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 338927 . 1 :2000MAY01 
<400> 30 

gtgagccgac cttctgagcc ctcaagaaag 
atctgtccca ggccagcgtg gctgagtgtg 
acgggtagct gagatccacc ccgagaaacc 
ctggcatctg tgtccaacct ggtcactgtg 
cccagaggcc agaggctaga ggacgtgcat 
ccaggaccac gggagaagac gaatgtcggg 
agcaggaggt acctgaactc cctgaagaac 
ttgccagcct gtgaccctct caggatcggg 
ggagctgctt ggagacatga cgcaggccat 
tgtttttcca ggagctgctg aagacagccc 
tcagggtcat cttctccaac atctcctcca 
cagagctgca gcggcgcctg gacgactggt 
accacagccc tggccaggtg gaggttgggg 
cctaaggcgc caggccctta attgtgtggc 
cacagtgaag ggcttgggtg aatgagcttc 
ctggggaaag gggtggatca ggtcctggcc 
tgaaaataag ccagtgccaa ctgagggctt 
attaactccc tcaatcttca catcagcctg 
gcatagtacc ctcttgtttt ctacagatgg 
ttacttgcct tatatagttt gccacgttag 
agaagaatgc tagtaaattg ttaaacttag 
tacctggggc atgcatattt tttaagagta 



atcagaacag attcatgggt gatttagcct 60 
ctggcctgga ggcctctctc tctgcttcag 120 
ggcaggatga agggggcaag tgaggagaag 180 
tttgagaata gcaggacccc agaagcagca 240 
caccgacctg agtgcaggcc tcccgagtcc 3 00 
gaggccgtgg ggtctgagcc caggacagtc 360 
aagctgtcca gcgaagcctg gaggaaatcc 420 
gacgcaggag ccagagaaga agatcgtcca 480 
atgtggcgcg cctccacctg ctagaccagg 540 
gcagcagcaa ggccttccca gaggatgtgg 600 
tctatcagtt ccattctcag ttcttcctcc 660 
gaggtccacc aggagcccct gaggcctcag 720 
agaccccagg gcgctttaag gttcaagcct 780 
catgcctccc tggggctcgg tcactgtctg 840 
tgagaggcat gtgtgtgtgc agactgcttc 900 
aagccaagtg accaagtgac tttgggacag 960 
cctacacgcc tggccttggg ctttccctct 1020 
ctgtgtgtat gtacggctat catggttatg 1080 
gcccacttat agctattgta acagtgttag 1140 
ttttggatgt cctttataga tctttggcaa 1200 
tgatgtattg attacaggcg tgagccacca 1260 
ccccctggag tttaatatat agcctgggtt 1320 
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acctgggatg aagaataaag agagcaaact accccaagca atggttgagc ccctgtcaag 1380 
tgccagtcat g 1391 

<210> 31 

<211> 1886 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:335558 . 1 :2000FEB18 
<220> 

<221> unsure 
<222> 1844 

<223> a, t, c, g, or other 



<400> 31 

cattagcgct aggaagatgc gcacccggca atgttccctc gaaggggcag cggtagtggc 60 

agcgcctctg ctctcaatgc agcaggtacc ggcgtcggta gtaatgccac atcttccgag 120 

gattttccgc ctccgtcgct gcttcagccg ccgccccctg cagcatcttc tacgtcggga 180 

ccacagcctc cgcctccaca aagcctgaac ctcctttcgc aggctcagct gcaggcacag 240 

cctcttgcgc caggcggaac tcaaatgaaa aagaaaagtg gcttccagat aactagcgtt 300 

actcctgctc agatctccgc tagtatcagc tctaacaaca gtatagcaga ggacactgag 360 

agctatgatg atctggatga atctcacacg gaagatctct cttcttcgga gateetfegat 420 

gtgtcacttt ccagggctac tgacttaggg gagcccgaac gcagctcctc agaagagacc 480 

ctaaataact tccaggaagc cgagacacct ggggcagtct ctcccaacca gccccacctt 540 

cctcagcctc atttgcctca ccttccacaa cagaatgttg tgatcaatgg gaatgctcat 600 

ccacaccacc tccatcacca ccatcagatt catcatgggc accacctcca acatggtcac 660 

caccatccat ctcatgttgc tgtggccagt gcatccatta ctggtgggcc accctcaagc 720 

ccagtatcta gaaaactctc tacaactgga agctctgaca gtatcacacc agttgcacca 780 

acttctgctg tatcatccag tggttcacct gcatctgtaa tgactaatag gcgtgctcca 840 

agtactacag gtggaactag gtataaattc tgttactggc actagtacag taaataatgt 900 

taacattact gctgtgggta gttttaatcc taatgtgaca agcagcatgc ttggtaatgt 960 

taatataagt acaagcaata ttcctagtgc tgctggtgtg agtgttgggc ctggagttac 1020 

cagtggtgtt aatgtgaata tcttgagtgg catgggcaat ggtactattt cttcctctgc 1080 

tgctgttagc agtgttccta atgcagctgc agggatgact gggggatcgg tttcaagtca 1140 

gcagcaacaa ccaacagtta acacttcgag gttcagagtt gtgaagttag attctagttc 1200 

tgagcccttt aaaaaaggta gatggacttg cactgagttc tatgaaaaag aaaatgctgt 1260 

acctgctaca gaaggtgtgc tgataaataa agtggtggag actgtaaagc aaaatccgat 1320 

agaagtgact tctgaaaggg agagcactag tgggagttca gtgagcagta gtgtcagcac 1380 

actgagtcac tatacagaga gtgtgggaag tggagagatg ggagccccta ctgtggtggt 1440 

gcagcagcag cagcagcaac aacgactact acagcaacaa c cage tc tec aaggtgtgac 1500 

cc tccaacag atggattttg gtagcactgg tccacagagt attccagcag ttagtatacc 1560 

acagagtatt tctcagtcac agatctcaca agtacaatta cagtctcaag aactgagcta 1620 

tcagcaaaag caaggtcttc agccagtacc tctgcaagcc actatgagtg ctgcaactgg 1680 

tatccagcca tcgcctgtaa atgtggttgg tgtaacttca gctttaggtc agcagccttc 1740 

catttccagt ttggctcaac cccagctacc atattctcag gcggctcctc cagtgeaaac 1800 

tccccttcca ggggcaccac caccccaaca gttacagtat gganaacagc aaccaatggt 1860 

ttctacacag atggccccag gcatga 1886 

<210> 32 

<211> 1540 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 998283 . 7 :2000FEB18 
<400> 32 

ctttactgta tctgtaaaac gecttatgat gaatctaagt tctatattgg ctgtgatctt 60 

tgtactaact ggtatcatgg agaatgtgtt ggcatcacag aaaaggaggc taagaaaatg 120 

gatgtgtaca tctgtaatga ttgtaaacgg gcacaagagg gcagcagtga ggaattgtac 180 

tgtatctgea gaacacctta tgatgagtca caattttata ttggctgtga tcggtgtcag 240 

aattggtacc atgggcgctg cgttggcatc ttgcaaagtg aggcagagct cattgatgag 300 

tatgtctgtc cacagtgcca gtcaacagag gatgecatga cagtgctcac gccactaaca 360 
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gagaaggatt atgaggggtt gaagagggtg ctccgttcct tacaggccca taagatggcc 420 
tggcctttcc ttgaaccagt agaccctaat gatgcaccag attattatgg tgttattaag 480 
gaacctatgg accttgccac catggaagaa agagtacaaa gacgatatta tgaaaagctg 540 
acggaatttg tggcagatat gaccaaaatt tttgataact gtcgttacta caatccaagt 600 
gactccccat tttaccagtg tgcagaagtt ctcgaatcat tctttgtaca gaaattgaaa 660 
ggcttcaaag ctagcaggtc tcataacaac aaactgcagt ctacagcttc ttaaagttca 720 
gcgtgttaac ctaacataaa acacagcaag aatctggttg tctgaactat tttaaattaa 780 
ggagccacrat gtttttagtc aggctatcct gacaagactt gacctaaact tcgtttttat 840 
tggtcataac agtccaatta tattcttggc caattttgtc caacggacaa gaaaaaagca 900 
aagtcaacga caccattatc ttgtcaagat cagatggttt tactatttgt ggcagaagcg 960 
agaaaatttt gtttattaaa aaaaaaaaag aaaaagaaag caagaaacaa tgatactgtg 1020 
gggtcaagta taactccatg gaaatgccac gtctgctctt cagtgaagaa gctggtttag 1080 
agtctcaaag aaaacttttg actgtattta tttattgttg caaaaaagat gcttttttat 1140 
tgctgccctc atttgtcaac taattatttt ttcttataaa atccagccac ggttacatat 1200 
aatccatcca tatcttatca atgattcctg tacgtaaaag tacaagacaa cctctagatg 1260 
tcttttcttt ctatgaaagg agctgctatg tacacatgtg cacacacaca caactgcgaa 1320 
tcaacaatga gtttattgtt catggtagat taaaattaag cttgcataaa ggttgggcta 1380 
agtggtcctg gactacagac tctggtgact tgaatataac agtacaattt gtcaattact 1440 
ccacaccagg ttgaaatgag taaaatctat ttgaaggtat cttctttgta aacatttgtc 1500 
agattctaat ttttcttttt gtattaaaat tcaactatgg 1540 

<210> 33 

<211> 613 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID NO: LI : 40273 9 . 1 :2000FEB01 
<400> 33 

ccgccgtggg ggcagagaag tccaacccct ccaagcgaca ccggcgggac atcaaggccg 60 
aggacgatga tcccgccggg ggagtgcacg tacgcgggcc ggaagcggag gaggcccctg 120 
cagaaacaga ggcccgccgt gggggcagag aagtccaacc cctccaagcg acaccgggac 180 
cgcctcaacg ccgagttgga ccacctggcc agcctgctgc cgttcccgcc tgacatcatc 240 
tccaagctgg acaagctttc tgtcctgcgc ctcagtgtca gttacctccg ggtgaagagc 300 
ttcttccaag gacaggggtt ggcagtggca gatgccgaag atgtggatga ccacacgggc 360 
gaacgaaggc ccatgtcgtt caggaggccc cgagcattgg acacccaggc cctgagacgt 420 
actcagttcg gtctgcatct gttgatggtg aatattgcag gactgatagc cactgaccgc 480 
ctgtagtgat gtgtgaggac aatgaaggca atggtgattc acaggcaggc cagcaccata 540 
tgggtgttag ttgtaaattc atggttacat aacataataa atttacctac gaattcatta 600 
tttataaaca aaa 613 

<210> 34 

<211> 507 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 175223 . 1 :2000FEB01 
<400> 34 

gcacattaga ccatgcgagc taaatttgtg atcgcacaaa atcaagatgt tagattgatg 60 
cagaagatca ctccgttcca aagggaaagt tttcatctca cgagtttgga gctgagggcc 120 
cgtggggcaa catggccgaa ggcggggcta gcaaaggtgg tggagaagag cccgggaagc 180 
tgccggagcc ggcagaggag gaatcccagg ttttgcgcgg aactggccac tgtaagtggt 240 
tcaatgtgcg catgggattt ggattcatct ccatgataaa ccgagaggga agccccttgg 300 
atattccagt cgatgtattt gtacaccaaa gcaaactatt catggaagga tttagaagcc 360 
taaaagaagg agaaccagtg gaattcacat ttaaaaaatc ttccaaaggc cttgagtcaa 420 
tacgggtaac aggacctggt gggagcccct gtttaggaag tgaaagaaga cccaaaggga 480 
agacactaca gaaaagaaaa ccaaagg 507 

<210> 35 

<211> 653 

<212> DNA 

<213> Homo sapiens 



19/228 



WO 01/62927 



PCTAJS01/06059 



<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 981076 .2 :2000MAY19 
<220> 

<221> unsure 
<222> 642 

<223> a, t, c, g, or other 
<400> 35 

gggcaacccg tctggaggca gcgcgcccgc gggcagcctc gttgtggctc cttgggctct 
gttggcggcg gcgctagctt cggagtctcc cgcgcgcacc tcagccgcct cctagcggcg 
cggcgctcgc tcctacgcct aaaatgacca atgtgtgatt tcagtggaat aaatggcgtc 
caaagtcaca gatgctatag tctggtatca aaagaagatt ggagcatatg atcaacaaat 
atgggaaaaa tctgttgaac agagagaaat caaggggcta aggaataaac caaagaaaac 
agcacatgtg aaaccagacc tcatagatgt tgatcttgta agagggtctg catttgcaaa 
ggcaaagcct gaaagtcctt ggacttctct gaccagaaag ggaattgttc gagttgtatt 
tttccccttt ttcttccggt ggtggttaca agtaacatca aaggtcatct ttttctggct 
tcttgtcctt tatcttcttc aagttgctgt aatagtatta ttctgctcca cttctagccc 
acacagcata cctctgacag aggtgattgg gccgatatgg ctgatgctgc tcctgggaac 
tgtgcattgc cagattgttt ccacaagaac acccaaacct cntctaagta cag 

<210> 36 
<211> 589 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 1008973 . 1: 2000MAY01 
<400> 36 

cacatcccga ggaacagaag 
ccaagatggc ccgctgctcc 
ggtcgccgcc tggcgaccgg 
cccgagctcg gacagagccg 
tggcggcagg gtccactacg 
agacggagcc gaaaaagcta 
cagacatcct ggcggagact 
acgtgggcga ctttgccaga 
gaaacaccgg gcctgacccg 
aggctcttca ggagcgggaa 

<210> 37 
<211> 1315 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: LI : 1190250 . 1 : 2000MAY01 

<400> 37 

tagggaatgg ctggggttgg ctgaagagcg cacagtggag ttttaatgtc cgccatgttg 60 
agccatggcg tattgaagag agcgagcgag agaggagcgg agcggcacag cctcccaccc 120 
tcccggctgg tgttagtgcc cggacggcgg gctctgcgct ccgcccctca agtccccggc 180 
agcggttggc gagtggggac cgaacccccg gttctccatg atcccgctgg ccggggccgt 240 
ttcccccaga gcggagaggt atctgctgcg cctgagatga gtaaactgtc gtttcgggcg 300 
cgggcgctag acgcctcgaa gccgctgccg gttttccgct gtgaggatct gcccgacctg 360 
cacgaatacg cctcgataaa cagggccgtg ccgcagatgc ccaccggaat agagaaggaa 420 
gaggagtcgg aacatcatct tccagcgggc t tat tt cage acagcaggtg tatggegaga 480 
agagggattt atatggttat accggtccca gaggcagaaa gtaattattg cttactatga 540 
agtctataat atcctgggga atttaagatg ccaaaagcag cttcatttca catacagcct 600 
tttagtttgg atgetgaaca gectgattat gatttggatt tctgaagatg aagatatttg 660 
tgaatataac tgttaaaaga aaaatggaac acttctgccc ccttgcccat ttgtaggcaa 720 
atggattgac ccccctagaa aaaggcagtg gtcagcagcc agtcagtctg caggaageca 780 
aactactget aaaagaagat gatgaactaa ttagagaagt ttatgaatat tggattaaaa 840 



60 
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cgggcctgct cagcatgtct 
gaggacccag ctcgctccag 
geagegagga ccggccaccc 
cgaccgcgag gaeagegacg 
ctgcgcgcag tggggaagct 
gagaaatatt tgcagaaact 
ggactcagaa agacggtgaa 
gaettagegg cccggtggaa 
caggaccctg aggagagege 
aagggctggg ggcttcccag 



gcaaggaccg gcgttccaga 60 
gctgtgactc cacatcccga 120 
cagactgcct gtgccgccgc 180 
cctgcacagc tetggegaga 240 
gcaggtgcgt ctggccacta 300 
ctccgccttg cccatgaccg 360 
gcgcctgcgg aagcaccagc 420 
gaagctggtg ctcgtggacc 480 
ttcccgacag cgctacgggg 540 
aaaacgega 589 
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agagaaaaaa ctgtcgaggg ccatctctta ttccatcagt aaaacaagag aagcgagatg 900 
gttccagcac aaatgatcct tatgtggctt ttagaaggcg tactgaaaaa atgcagactc 960 
gaaaaaatcg caaaaatgat gaagcctctt acgtaaatat gcttaagctg cgacgagatc 1020 
taagtcgagc tgttactatt ctagagatga taaaaagaag agataaaagt aaaaggagag 1080 
ctattgcact taacactgga aattatggaa aagaggtata atttgggcga ctacaatgga 1140 
gagatcatgt ctgaggttat ggcacagaga cagccaatga aacctactta tgccatcccc 1200 
atcatcccta ttactatata gcagtctatt taaacaccat ggaagcatat ggatgtgaag 1260 
gagttcaaat gttaataagc aagattatag tccgatctta tccgaccgat acggt 1315 

<210> 38 
<211> 1616 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 021371.3 :2000FEB18 
<400> 38 

gtgacagagc cgtgcctgga acccaggtct cctgactggg aggtcccagc tctcgccgtt 60 
gctctaatcc aggccaggtc ctactgtgct gacagaatgt cggctgttgg gcggtagtat 120 
ccatgctgcc ctctaaccac aggctgtctg tgtcatgtga ctcaatgcac ccatcagtgg 180 
tctgtgatga gtgggtggga ggatattttg gctggcctct ggcccacttg tccacgtctg 240 
tcctggtgtg tcagtggctg gcttgtgacc tggcatgtcc gtggcaggtg tggaaggaga 300 
gcctctagtg agttcccaga gtggacagag ccctccagarr ccacaggatc ccgaggctcc 360 
cagctcctca ggcccaggac acctggtggc catgggcaag gtgagcagga cccctgtgga 420 
agctggtgtg agccagtcag atgcagagaa cgcagccccc tcctgcccgg atgagcatga 480 
cactctgccc cggcgccgag gtcgaccttc caggcgcttc ctaggcaaga aataccgcaa 540 
gtactattac aagtcgccca aaccactttt gaggcccttc ctgtgccgca tctgtggttc 600 
tcgctttctg tcccacgagg acctgcgctt ccacgtcaac tcccatgagg ctggcgatcc 660 
ccagctcttc aagtgcctgc agtgcagcta tcgttcccgc cgctggtcct cgctcaagga 720 
gcacatgttc aaccacgtgg gcagcaagcc ctacaagtgt gacgagtgca gctacaccag 780 
tgtctaccgg aaggacgtca ttcggcacgc cgctgtgcac agccgggacc ggaagaagag 840 
gccagatccg actccaaagc tgagctcttt cccctgccct gtgtgtggcc gtgtgtaccc 900 
catgcagaaa agactcacgc agcacatgaa gacgcacagc actgagaagc cccacatgtg 960 
tgacaagtgt ggaaagtcct ttaagaagcg ctacaccttc aaaatgcacc tgctcacgca 1020 
catccaggct gttgccaacc gcaggttcaa gtgtgagttc tgtgagtttg tttgtgaaga 1080 
caagaaggca ctgctgaacc accagttgtc ccacgtcagt gacaagccct tcaaatgcag 1140 
cttttgtccc taccgcacct tccgagagga cttcttgctg tcccatgtgg ctgtcaagca 1200 
cacaggggcc aagcccttcg cctgtgagta ctgccacttc agcacacggc acaagaagaa 1260 
cctgcgcctg cacgtacggt gccgacacgc aagcagcttc gaggaatggg ggaggcgcca 1320 
ccctgaggag cccccctccc gccgtcgccc cttcttctct ctgcagcaga ttgaggagct 1380 
gaagcagcag cacagtgcgg cccctggacc acctcccagt tccccaggac ctcctgagat 1440 
acccccagag gcgacaactt tccagtcatc tgaggctccc tcattgctct gttctgacac 1500 
cctgggcggc gccaccatca tctaccagca aggagctgag gagtcgacag cgatggccac 1560 
gcagacagcc ttggatcttc tgctgaacat gagtgctcag cgggaactgg ggggca 1616 

<210> 39 

<211> 521 

<212> DNA -v 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID NO: LG:475404 . 1:2000FEB18 
<400> 39 

ggcggttgct ttgtctctcc ttcgccggag ccggctcttc ctgttagtct ccgctgctag 60 

ttcttggctc tgggaggccc aggtggctct gcagcagcct ctgccaccct gtgacctgcg 120 

agtattggga catccctagc tgacgccagg acacccggga agccgaggaa tggactcggt 180 

ggcttttgag gatgtggctg tgaactttac ccaggaggaa tgggctttgc tagattcttc 240 

tcagaagaat ctctacagag aagtgatgca ggaaacctgc aggaacctgg cttctgtagg 300 

aagccaatgg aaagaccaga atattgaaga tcacttcgaa aaacctggga aagatataag 360 

aaatcatatc gtacagagac tgtgtgaaag taaagaagat ggtcagtatg gagaagttgt 420 

cagccaaatt ccaaatcttg atctgaacga gaacatttct actggattaa aaccatgtga 480 
atgcagtatt tgtggaaaag tctttgtacg tcattccctc c 521 
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<210> 40 
<211> 848 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 979406 . 2 :2000FEB18 
<220> 

<221> unsure 

<222> 407, 509, 590, 645, 689, 707, 829 
<223> a, t, c, g, or other 

<400> 40 

aattcactgt ctgtagcatc 
ccgatgcctg gacctcagga 
atagagtgga gttgcctggc 
aactatggga acctggcctc 
ttggagcaag gagaggagcc 
aaagccagcc tgtgcacaga 
cctgccagct cagaaggtgg 
ggtgctgcat cctcttggcc 
ggtgaggggg gattgagtta 
acatgcagac ctggGeccfec 
tgccaggggc tgcagaaaga 
agggctggat ccaccctcca 
gaagctgaag ggttgtccct 
caggaaagca cgaaggtcag 
cacctggg 

<210> 41 
<211> 1307 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: LG:410726 . 1 :2000FEB18 

<400> 41 

tgctgttgcg gatgctgatg cgtttctgca ggactgtaaa ctggattcct ggaacctttg 60 

atattcctgg ctgtgtatag tgcctgttgg tggactgtac tgatactcaa ctagagtgtg 120 

aagggactgg attcctgccc ctgagacaca atgcaagctg tagtgccctt gaacaagatg 180 

acagccatct caccagaacc tcaaactctg gcctcgactg aacaaaatga ggtcccaaga 240 

gtggttactt ctggggaaca agaagctatt ttaagaggaa atgctgctga tgcagagtct 3 00 

ttcagacaga ggtttaggtg gttttgttac tcagaagtag ctggacccag gaaagctctg 360 

agtcaactct gggagctctg caatcagtgg ctgagaccag acattcacac gaaagaacag 420 

attttagagc ttctggtgtt tgagcagttc ctgaccattt tgcctgggga gatcaggatt 480 

tgggtaaagt cacaacatcc tgagagtagt gaggaagtgg tgaccctaat agaagatttg 540 

acccagatgc ttgaagaaaa agatccagtc tctcaagatt ctactgtttc ccaagaggag 600 

aactcaaaag aggataaaat ggtcactgtt tgtcccaata ctgagtcctg tgaatctata 660 

acattgaagg atgtagctgt gaacttttca agaggagagt ggaagaagct ggagcctttt 720 

caaaaggagc tatataagga agtgctactg gaaaacctca ggaacctaga atttctggac 780 

tttccagttt caaaattaga gttgatttcc cagctaaagt gggttgaatt gccatggctg 840 

ctggaagaag tctcaaaaag ctcccgacta gatgaatcag ctttagataa aataatagaa 900 

aggtgcctca gggatgatga tcatggcttg atggaagaat cccagcaata ttgtggcagc 960 

tcagaggagg atcacggtaa tcagggaaat tcaaaaggaa gagtcgcaca atacaaaact 1020 

cttgggagtg gcagtagggg taagaaattt gacccagatt aaagcccctt tggacatgat 1080 

ttcaaagaaa cttcagacgt aattaaacat ctgagagtct acttgaggaa gaaatctcgg 1140 

aggtataatg aaagcaagaa acccttcagt tttcattcag accttgttct gaccagcaag 1200 

gagaaaaccg ccggagataa gtcacggaaa tctaatgatg gtgggaaagt cctgagtcac 1260 

tcttcagctc gtactgaaca tcagaaacgt cagaagattc atttttg 1307 

<210> 42 
<211> 2456 
<212> DNA 



tgctcctcca cagagggacc 
ggcggttgtg ttcgaggatg 
ccccgaccag caggcactct 
actaggcttt cttgttgcca 
gggggccttg attctgcagg 
ggaccctaat acactgccca 
gccaggagag aagggtgtgg 
ccacggggag caccctgtca 
cctgcccant ggtgtgtcaa 
caaggcccac tgctccccgg 
gccatgggaa gcctggccct 
gagaacaana atcctgggca 
gagcatcccc aacatggagc 
ggccagcctg gggaaccaag 



ctggaatggc gatggcactc 60 
tggctgtgta cttcacaagg 120 
acagggacgt gatgctggag 180 
aaccagcact gatctcccta 240 
tggctgaaca gagcgtggcc 300 
gcagaagcca ggaaggaagc 360 
ccggaanggt agccggtggg 420 
cccctaacag atgaagagaa 480 
actccatgat gggggtggga 540 
gctgtgcgcn ccacagtetg 600 
ggtanggcac caagatgggg 660 
gtagccngga catggtgggg 720 
tcggaagtct gggccatggg 780 
agctgtgcna cgatcccttc 840 

848 
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<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG-.200005 . 1 :2000FEB18 
<400> 42 

ggagcctggt caacaagcca gcaccactgt gctaggaggc caggccggct ctgcagccga 60 
cgcatcccgt gtgagcagac acacatagct cgctttcttg ggggctcttt gtcagccacc 120 
aacacagaga gcacacactg acacacacgt acacaaagat ctgagccagg ttgtgggagg 180 
gtatgttctg tcttgtgctt tttcttttag aagctactaa agggtgttgg ggatgcttct 240 
gactattatg aaggccaaaa ggcctgttga ctggggctgc ttttaaccct ttcctatttg 300 
ctgagaatgc agccgtgtga cagtaactga acattggtct aaagtctttc caaaaggtca 360 
aggttcacaa gaacatctgc fccaaattaat gaccatgggg gatatgaaga ccccagactt 420 
tgatgacctc ctggcagcat ttgacatccc agatatggtc gatcctaaag cagctattga 480 
gtctggacac gatga'ccatg aaagccacat gaagcagaat gctcacggag aggatgactc 540 
ccacgcacca tcatcttctg atgtgggtgt cagcgttatc gtcaagaatg ttcggaacat 600 
tgactcttcc gagggcgggg agaaagacgg ccacaacccc actggcaatg gcttacataa 660 
tgggtttctc acagcatcct cccttgacag ttacagtaaa gatggagcaa agtccttgaa 720 
aggagatgtg cctgcctctg aggtgacact gaaagactcg acattcagcc agtttagccc 780 
gatctccagt gctgaagagt ttgatgacga cgagaagatt gaggtggatg acccccctga 840 
caaggaggac atgcgatcaa gcttcaggtc gaatgtgttg acggggtcgg ctccccagca 900 
ggactacgat aagctgaagg cactcggagg ggaaaactcc agcaaaactg gactctctac 960 
gtcaggcaat gtggagaaaa acaaagctgt taagagagaa acagaagcca gttctataaa 1020 
cctgagtgtt tatgaacctt ttaaagtcag aaaagcagag gataaattga aggaaagetc 1080 
tgacaaggtg ctggaaaaca gagtcctaga tgggaagctg agctccgaga agaatgacac 1140 
cagcctcccc agcgttgcgc catcaaagac aaagtcgtcc tccaagctct cgtcctgcat 1200 
cgctgccatc gcggctctca gcgctaaaaa ggcggcttca gactcctgca aagaaccagt 1260 
ggccaattcg agggaatcct ccccgttacc aaaagaagta aatgacagtc cgagagccgc 1320 
tgacaagtct cctgaatccc agaatctcat cgacgggacc aaaaaaccat ccctgaagca 1380 
accggatagt cccagaagca tctcaagtga gaacagcagc aaaggatccc cgtcctctcc 1440 
cgcagggtcc acaccagcaa tccccaaagt ccgcataaaa accattaaga catcttctgg 1500 
ggaaatcaag agaacagtga ccagggtatt gccagaagtg gatcttgact ctggaaagaa 1560 
accttccgag cagacagcgt ccgtgatggc ctctgtgaca tcccttctgt cgtctccagc 1620 
atcagccgcc gtcctttcct ctccccccag ggcgcctctc cagtctgcgg tcgtgaccaa 1680 
tgcagtttcc cctgcagagc tcacccccaa acaggtcaca atcaagcctg tggctactgc 1740 
tttcctccca gtgtctgctg tgaagacggc aggatcccaa gtcattaatt tgaagctcgc 1800 
taacaacacc acggtgaaag ccacggtcat atctgctgcc tctgtccaga gtgccagcag 1860 
cgccatcatt aaagctgcca acgccatcca gcagcaaact gtcgtggtgc cggcatccag 1920 
cctggccaat gccaaactcg tgccaaagac tgtgcacctt gccaacctta accttttgcc 1980 
tcagggtgcc caggccacct ctgaactccg ccaagtgcta accaaacctc agcaacaaat 2040 
aaagcaggca ataatcaatg cagcagcctc gcaacccccc aaaaaggtgt ctcgagtcca 2100 
ggtggtgtcg tccttgcaga gttctgtggt ggaagctttc aacaaggtgc tgagcagtgt 2160 
caatccagtc cctgtttaca tcccaaacct cagtcctccc gccaatgcag ggatcacgtt 2220 
accgacgcgt gggtacaagt gcttggagtg tggggactcc tttgcacttg aaaagagtct 2280 
gacccagcac tacgacagac ggagcgtgcg catcgaagta acgtgcaacc attgtacaaa 2340 
gaacctcgtt ttttacaaca aatgcagcct cctttcccat gcccgtgggc ataaggagaa 2400 
aggggtggta atgcaatgct cccacttaat tttaagccag tcccagcaga tcaaat 2456 

<210> 43 

<211> 738 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG:1076828 . 1 :2000FEB18 
<220> 

<221> unsure 

<222> 74, 161, 279, 339, 346, 362 
<223> a, t, c, g, or other 

<400> 43 

actcctaagc ctcaagtgat ctccttattg gaacaaggga aagagccctg gatggttggc 60 
agagagctta caanaggcct gtgttcagat ctggaatcga tgtgtgaaac caagttatta 120 
tctctaaaga aggaagttta tgaaatagaa ttatgccaga nggagataat gggacttaca 180 
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aagcatggcc ttgagtactc cagttttgga gatgttttgg aatatagaag ccaccttgca 240 
aaacaactgg gatatccaaa tgggcatttt agtcaagana tattcactcc tgaatacatg 300 
cccacattta ttcaacagac attccttact ctccatcana taattnataa tgaagacaga 360 
cnctatgaat gtaaggaatg tggaaagatg tttagtcatg gctcacaact tactcaacat 420 
cagagaatac acactggtga gaaaccctat caatgtaagg aatgtggaaa agcgtttaat 480 
cgtggctcac tccttacacg acaccagagg attcatactg gtgagaaacc ctatgaatgt 540 
aaagaatgtg gaaaaacctt tagtcgtggc tcagaactta ctcaacatga gcgaattcac 600 
acagctggag ctccgttact ctcatgggga ctttgagaat ggtatctggg tgttatggat 660 
catggagcag aaatacagaa ggagtctgca accctgatga ctttgtggaa cagccatact 720 
agctctggtc tgcttgcc 738 

<210> 44 

<211> 310 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG:1076931 . 1 :2000FEB18 
<400> 44 

gggcggtgaa ggtcagagtg cagacctgag accactgctc accgacttca gactccagtt 60 
tatctgtgcc ccagcttcct cacttagtct cagaagactt aggctgaggc ctcagaagga 120 
gatctccatc ctttgtccag agcagaacag gatggccatg tcccaggaat cattgacctt 180 
caaggacgtg ttfcgtgggct tcaccctgga ggagtggcag caactggacc ctagtcagag 240 
ggctctgtat agagatgtga tgctggagaa ctacagcaac cttgtctcag tggggtattg 300 
tgctcacaaa 310 

<210> 45 

<211> 404 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 1078121 . 1 : 2000FEB18 
<400> 45 

tgctcgggga agctcaggtg gccgccacag cctcttgccc tctgacctgc aggtatcagg 60 
agatccatag ctaagcagcc tggacacccg ggaagctggg aaatggtgtg atttgtccag 120 
ataatctcac ctgagaagga atcccagaga agaaggaaga agaggaagaa atggctggtt 180 
ctcagggact gttgatattc agggatgtgg ccatagaatt ctctccggag gagtggagct 240 
atctggaccc tgctcagcag aatctgtata gggacgtgat gttagaaaac tacagaaacc 300 
tggtctccct gggtattgct gtctctaagc cagaactgat cacctgtcta gagcaaagga 360 
atgagccctg gaatgtgaag aaacattgag acagtagcca gaca 404 

<210> 46 

<211> 1179 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1079203 . 1 : 2000FEB18 
<400> 46 

gggagctcta ggtctagtct ttactgctct gtgtattctg ctcctagagg cccagcctct 60 
gtgactccgt tatctggcag gtattgggag atgcacagct aagatgccag gaccacctgg 120 
aagcctagaa atgttatata ttctcatttc actgaagacc tttggccaga gcatagcata 180 
aaagattctt ttcaaaaagt gatactgaga ggatatggaa aatgtggaca tgagaattta 240 
caattaagaa taagttgtaa aagtgtggat gagtctaagg tgttcaaaga aggttataat 300 
gaacttaacc aatgtttgag aactacccag agcaaaatat ttcaatgtga taaatacgtg 360 
aaagtctttc ataaattttc aaattcaaac agtcataaga aaagaaatac tggaaagaag 420 
gttttcaaat gtaaagaatg tggcaaatca ttttgcatgc tttcacatct aacacaacat 480 
ataagaattc acactagaga gaattcttac aaatgtaaag aatgtggcaa agttcttaac 540 
caatcctcag agcttattaa acataagaaa attcatactg gagagaaacc ctacacgtgt 600 
gaaaaatgtg gcaaaacctt taaccagtct gcaaatcttt atgcacataa gaaaattcat 660 
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actggagata aaaccataca agtgtaaaga atgtggcaaa gcctttaagt catactacag 720 
cattcttaaa cataagagaa ctcataccag gggaatgtct tacgaaggtg acgaatgtcg 780 
cggtctttaa atgctcctca atcctttcta atcataagat aattcataat gaagagaaac 840 
tctaaaatgt gaaaaatgtg agaaggcttt taatcacacc tcaatctgtt gtagacataa 900 
gagtaattat attgatgaga agccaaatta aaattaaaaa tgtggcaaag acttccaatt 960 
ctagtcagtt cttaatgtaa ttcctactga agaaaacacc tggaaataca aaaaatgtgg 1020 
caaaacttgt aaccaatgct tagcattttt gcacatgata gtatttatat gtgagactac 1080 
ttgtacaaac ataaaaatat acaaaagcta ttgtctactt gaaatttata gttaataaaa 1140 
gcaaataaat gtatttctgt tgaaagacct ttagaaata 1179 

<210> 47 

<211> 557 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1082586 . 1 : 2000FEB18 - 
<220> 

<221> unsure 
<222> 42 

<223> a, t, c, g, or other 



<4Q0> 47 

gaaaggaaaa agattgaaga aaagtaagca 
gcagaccaat atatacatcg taggagttcc 
ttggaagaaa taatggctga aaacttttca 
caagaagctc aacaaactcc aagtatgatg 
ataatcaaac tgtcaaaaga caaacacagg 
agtcttgtgt ttacttctct gtttcttctg 
gacctgcagg tattgggaga tccactgcta 
tgagatcatt gacatttagg gaggtggcca 
tggacactgc acagcagaat ttatatagga 
tcttcctggg tatattg 



cagcctagga gncccatggg acaatatcaa 60 
agcagaagag agaaaggggc agagagaata 120 
agtttgataa aagacatgaa tataaacatc 180 
aactcaaaga tagccacatt gagacatatt 240 
cctttgtctc tcactgcagc cagagctccc 3 00 
cttctagagg cccagcctct gtggccctgt 360 
agatgccagg accccctgga agcctagaaa 420 
tagaattatc tctggaggaa tggcaatgcc 480 
atgtgatgtt agggaactac agaaacctgg 540 

557 



<210> 48 

<211> 353 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1082774 . 1 :2000FEB18 



<400> 48 

cagagatccg agctatcgat cttaactgct ctgtgccgtc agctcctaga ggcccagcct 60 

ctgtggccct gtgacctgca ggtattagga gggtcacagc taggacgccg ggaccccctg 120 

gaagcctaga aatgggacca ttgcaattta gagatgtggc catagaattc tctctggagg 180 

agtggcattg cctggacgct gcacagcgga atctatatag ggatgtgatg ttagagaact 240 

acagaaacct gatcttcctt ggtattgttg tctctaaacc aaacctgatc acctgtctgg 300 

agcaaggaaa aaaaccttta actatgaaga ggcatgagat gattgccaaa ccc 353 

<210> 49 

<211> 1027 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LG: 1082775 . 1 :2000FEB18 
<400> 49 

ctctgctctg ccaacactag tggaatatga tcacatccca gggatcagtg tcatttaggg 60 
atgtgactat gggcttcact caagaggagt ggcatcatct ggaccctgct cagaggaccc 120 
tgtagaggaa tgtgatgctg gagaactaca gccaccttgt ctcagtaggg tattgcattc 180 
ctaaaccaga agtgatcctc aaattggaga caggcaagga gccatggata ttagaggaaa 240 
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aatttcgaag ccagagtcat ctgggtgagt tagtgccaga tggaatttaa agaattaatt 300 
aataccagta gaaactattc aagaatgaag ttcaatgagt ttaacaaagg tggaaaatgt 360 
ttctgtgatg aaagcatgaa ataattcatt ttgaagagga accttctgaa tataataaca 420 
atgggaacag cttctggctg aatgaagacc tcatttggca tcagaagatt aaaaattggg 480 
agcaaccttt tgaatacaat gaatgtggga aagctttccc tgagaattca ctcttccttg 540 
tacataagag agcttacaca ggacagaaaa catgcaaata tactgaacat gggaaaacct 600 
gttatatgtc attttttatt actcatcagc aaacacatcc aagagagaac cactatgaat 660 
gtaatgaatg tggagaaagt atctttgagg aatccattct ctttgaacat cagaatgttt 720 
acccattcag ccagaattta aatcccactc taattcagag aacccactca attagcaata 780 
ttattgaata taatgaatgt ggaacctttt tcagtgaaaa attagccctt catttacaac 840 
agagaacaca tccaggggaa aaaccttatg aatgtcatga atgtggaaaa accttcaccc 900 
agaagtcagc ccacacaaga catcagagaa cacacacagg gaaaacccta tgaatgtcac 960 
ggatgtggga agaccttgta taagaattca gacctcatta gaaatcaaag aattcacaca 1020 
gggagtt 1027 

<210> 50 
<211> 364 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 1083120 . 1 :2000FEB18 
<400> 50 

cgccaggcct acgagatttg acttgtaaag aattacttat attgactgag agagaagccc 60 
agaagaggaa gaagaggaaa gaaaaggagt cagggatggc tcttacacag ggacctttga 120 
cattcaggga tgtagccata gaattctctc aggaggagtg gaaatccctg gaccctgtgc 180 
agaaagcttt gtactgggat gtgatgttgg agaactacag gaacctggtc ttcctgggta 240 
aggataattt tgctctagaa gttaagatct gccctcgtgt atttttgtat tttctctgtt 3 00 
gtctttctgt gggagcccgt tccattcatt tgcactgaga ctgaagcctt gttgagacac 360 
aaat ^ ~ - 3 ^ 



<210> 51 

<211> 901 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 10877 07 . 1 :2000PEB18 



<400> 51 

cctagcttct aggctctgag tccagtaccc 
tagctcctcg gtgactccac catagcccct 
tccataggga agaaggcgga acatccggag 
atgtggctgt agaattctcc ccagaagagt 
tgtataggga tgtgatg.tta gagaactacg 
ttaagccaga cttgattacc tatctggagc 
agaagacagt agccaaacac ccagattatt 
attggagcta tatgtttctg aattctgaac 
tctgttgaac atatgacttc tctcgtctgc 
acatggattt aaatatatag atatatcact 
tagagaaagt aatcatattt gtttatggtt 
attagaatga acagataatt ttacaagtgt 
gagattttga aagcaaatct attttctctg 
tattgctcct ggcttagaat catcttgtgc 
gcctgttgct caccatagac ataattttct 
c 

<210> 52* 

<211> 254 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 



gtctgtacta ttccatctct tccgctccat 60 
gttatcctgt gacctgcagg tactgggaca 120 
gctgggaaat gcgagtgcta acattcaggg 180 
gggaatgcct ggactctgct cagcagcgtt 240 
gaaacctgtt ctccttgggt cttgctatct 3 00 
aaagaaaaga gccttggaat gcaaggagac 3 60 
atgatgtttg taatgaagat tatgagtata 420 
aactatttat aaaattttat cctacttttt 480 
taaacacata cagaccttta gttttggttt 540 
gtaaaataaa cttcaggtgt aacagattta 600 
gtgtacctac tttgagaaga aaagaaaaat 660 
tgatcactta ccagcaaacc agaaacttca 720 
ctgtgtatta aattcattta tctaaaatgt 780 
aaattctctt tttttgttgt ttgtctgttt 840 
tttcataaaa cattctttgt ataatcacct 900 

901 
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<223> Incyte ID No: LG: 1090915 . 1 : 2000FEB18 



<400> 52 

ttccgttttc gcgtggttct 
tattggaacc ccaccaagta 
ttcaggaatt agtgacattc 
gcctggaccc taatcagagg 
tggtctccct ggga 



tttgcaagct ctggattctc 
ggactgatca ggtcttacaa 
agggatgtgg ccatagactt 
gacttataca gggatgtgat 



tggagtttga atgtttccag 60 
ttctaaaacc atgacctgtt 120 
ctctcggcag gagtgggaat 180 
gttggagaac tacaggaacc 240 

254 



<210> 53 

<211> 597 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG:1094230 .1:2000PEB18 
<400> 53 

tgcagtcaga gctccaggtc tggttcttct cctaaaggcc caggctgtgt ggccccgtgt 60 

cc tgcaggta ttgggagatc cacagctaag acaccgggac ctcctggaag ccaaaaatgg 120 

gaccattgca atttagagat gtggccatag aattctctct ggaggagtgg cattgcctgg 180 

acactgcaca gcggaattta tataggaatg tgatgttaga gaactacagt aacctggtct 240 

tccttggtat tactgtttct aagccagacc tgatcacctg tctggagcaa ggaagaaaac 300 

ctttgaceat gaagagaaat gagatgatag ccaaaccctc agtgagcttc cttcaagttc 360 

acagtgagag ccaaagtcct cttcatgaca tataagagac tgcacagtgt ggctgctttt 420 

ccataatttt tgggacgcac aaatgtctgc atgattttga gaaactaaaa ttactctcaa 480 

agttctcttt ttgcatcaga tctgaaatgt ctgagagtaa tagtttctgt tgaatttttt 540 

tttgttcatt tttctgcaca gtccattctg tttttattac tatctaggct tgaaata 597 

<210> 54 

<211> 637 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:474848 .3 :2000FEB18 
<400> 54 

ggaagtgcgg cacccagagg cggtcttgta gctgggccag cctggggctt ggttctatgt 60 
ccctgcgggt cggtgcgagg gcgaagagga acccgtgggc ccgggggatc ccggggggcc 120 
ggaccagtgt tccccagttg tgggagcaga cgcgtgggcg catcgcgggc gggcagggcc 180 
tgaagtgcag ctatgtttcc agtgttctct ggctgtttcc aagagctaca agaaaagaat 240 
aaatctctgg agttggtgtc ctttgaggag gtagctgtgc acttcacctg ggaggagtgg 300 
caggacctgg atgacgctca gaggaccctg tacagggacg tgatgctgga gacctacagc 360 
agcctggtat cattggggca ttgcattacc aaacctgaga tgatcttcaa gctagagcaa 420 
ggagcagagc catggatagt agaagaaacc ctaaacctga gactttcagg ctaagcctcc 480 
ctccttttgg tagaggcttt tttataccat ttgggaactg gtttggataa gctaccttaa 540 
tgaaggacga cacatccctc cagagatgat aaaagacttt ttgtcttttc tgatttaaga 600 
aaactttatc tcgtaagcta cctatagttt acaaaaa 637 

<210> 55 
<211> 754 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 251656. 1:2000FEB01 
<220> 

<221> unsure 
<222> 702 

<223> a, t, c, g, or other 
<400> 55 
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tgaaaatggg gagaactgta atcaagacat gtttgagaat gaatcacgta agatattctc 60 

ggaaatgcct gaaggtgaaa gtgctcagca ctccgatggg gaaagtgact ttgagagaga 120 

tgctggcatc cagaggctcc agggacacac cccaggtgag gaccacgggg aggtggtttc 180 

tcaggacagg gaagttggcc agctcatagg cctgcagggc acctacctag gggagaagcc 240 

ctacgaatgt ccccagtgtg ggaagacctt cagccggaaa tcccacctca tcacacacga 300 

gaggacccac acaggagaga aatactacaa atgtgatgaa tgtggaaaaa gctttagtga 360 

tggttcaaat tttagtagac accaaaccac tcacaccggg gagaagccct acaaatgcag 420 

agactgtggg aagagcttta gccggagtgc caacctcata acccaccaga ggatccacac 480 

gggggaaaag cccttccagt gtgccgagtg tggcaagagc ttcagcagga gtcccaacct 540 

cattgcacat cagcgcaccc acacaggaga gaaaccctac tcgtgccccg agtgtggaaa 600 

gagctttggc aaccgatcca gccttaacac gcatcagggg atccacactg gagaaaagcc 660 

ctacgaatgt aaagaatgcg gcgaaagttt tagttacaac tncaatctaa tcagacaaca 720 

gagaattcac acaggagaga aaccctacaa atgt 754 

<210> 56 

<211> 1601 

<212> DNA 

<213> Homo sapiens 

<220> ' : 
<221> misc_feature 

<223> Incyte ID No: LI : 021371 . 1 :2000FEB01 
<400> 56 

acgtctgtcc tggtgtytca gtggctggct tgtgacctgg catgtccgtg gcaggtgtgg 60 
aaggagagcc tctagtgagt tcccagagtg gacagagccc tccagagcca caggatcccg 120 
aggctcccag ctcctcaggc ccaggacacc tggtggccat gggcaaggtg agcaggaccc 180 
ctgtggaagc tggtgtgagc cagtcagatg cagagaacgc agccccctcc tgcccggatg 240 
agcatgacac tctgccccgg cgccgaggtc gaccttccag gcgcttccta ggcaagaaat 300 
accgcaagta ctattacaag tcgcccaaac cacttttgag gcccttcctg tgccgcatct 360 
gtggttctcg ctttctgtcc cacgaggacc tgcgcttcca cgtcaactcc catgaggctg 420 
gcgatcccca gctcttcaag tgcctgcagt gcagctatcg ttcccgccgc tggtcctcgc 480 
tcaaggagca catgttcaac cacgtgggca gcaagcccta caagtgtgac gagtgcagct 540 
acaccagtgt ctaccggaag gacgtcattc ggcacgccgc tgtgcacagc cgggaccgga 600 
agaagaggcc agatccgact ccaaagctga gctctttccc ctgccctgtg tgtggccgtg 660 
tgtaccccat gcagaaaaga ctcacgcagc acatgaagac gcacagcact gagaagcccc 720 
acatgtgtga caagtgtgga aagtccttta agaagcgcta caccttcaaa atgcacctgc 780 
tcacgcacat ccaggctgtt gccaaccgca ggttcaagtg tgagttctgt gagtttgttt 840 
gtgaagacaa gaaggcactg ctgaaccacc agttgtccca cgtcagtgac aagcccttca 900 
aatgcagctt ttgtccctac cgcaccttcc gagaggactt cttgctgtcc catgtggctg 960 
tcaagcacac aggggccaag cccttcgcct gtgagtactg ccacttcagc acacggcaca 1020 
agaagaacct gcgcctgcac gtacggtgcc gacacgcaag cagcttcgag gaatggggga 1080 
ggcgccaccc tgaggagccc ccctcccgcc gtcgcccctt cttctctctg cagcagattg 1140 
aggagctgaa gcagcagcac agtgcggccc ctggaccacc tcccagttcc ccaggacctc 1200 
ctgagatacc cccagaggcg acaactttcc agtcatctga ggctccctca ttgctctgtt 1260 
ctgacaccct gggcggcgcc accatcatct accagcaagg agctgaggag tcgacagcga 1320 
tggccacgca gacagccttg gatcttctgc tgaacatgag tgctcagcgg ggaccggggg 1380 
gcacagccct gcaggtgtgc tgcctgggaa cctgctcccc ttcccagcta ccgcagtacc 1440 
cagccctgca ctggacactg ggactagagg aaaacagtgt atcagagctt ctccgccctt 1500 
gggggctccc aggcagtgga ggagacagat ctgctgaagt ctggtgggca aacagagagg 1560 
aacaggcttt gccacggagg ccgcagggta tcccatccat t 1601 

<210> 57 
<211> 1059 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_£eature 

<223> Incyte ID No: LI:133095.1:2000FEB01 
<220> 

<221> unsure 
<222> 107 

<223> a, t, c, g, or other 
<400> 57 
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tggactgggc gaggaggtgc cctgtgccat gatggagggt gtggcagcct acacccagac 60 
agagcccgag ggtagccagc ctagcaccat ggacgccacc gcagtancag gcatcgagac 120 
caagaaagag aaggaggacc tgtgcttgct aaagaaggag gagaaggagg agccagtagc 180 
cccggagctg gcaacaacgg tgcctgagag cgcagagcct gaggcagagg cggacgggga 240 
ggagctggac ggcagcgaca tgtcagccat catctatgaa atccccaagg agcctgagaa 300 
gaggcggcgg agcaagcggt cgcgggtgat ggatgctgac ggcctgctcg agatgttcca 360 
ctgcccatac gagggctgca gccaagtcta cgtggccctc agcagcttcc agaaccacgt 420 
caatcttgtg catcggaaag gaaagaccaa agtgtgccct catcctggct gtggcaagaa 480 
gttctattta tccaaccacc tgcggcggca catgatcatc cattcaggtg tccgtgaatt 540 
cacctgcgag acctgcggca agtccttcaa gaggaagaac cacctggagg tacatcggcg 600 
cacccacacc ggcgagaccc ccctgcagtg cgagatctgt ggctaccagt gccggcagcg 660 
cgcgtcgctc aactggcaca tgaagaagca cactgcggag gtgcagtaca acttcacgtg 720 
cgatcgctgc gggaagcgct tcgagaagct ggacagcgtc aagttccaca cgctcaaaag 780 
ccacccggat cacaagccca cctgacccac ctgaccactg accgccccta tttattcgtc 840 
cgctcggaca ccacgcccgg gcttgccggg gcctggacag ctgcgagggc cgcccggacc 900 
gcgggccgga aggaggcgcc cccgccccgc cccagagctg ggccccctgg gcaggttccc 960 
caccccgccc caccgcatcc ttctcggagc tggtgcctgg ggctgcattg ctggaactgt 1020 
gtcaagagag cagagtgaga ttaaagagcg agaaaggag 1059 

<210> 58 

<211> 1187 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI :236654 . 2 :2000FEB01 
<400> 58 

gcggccgctg cctgcagacc ttcccgttgg aggccatcac tgccttcatg gaccacaaga 60 

agctgggctg tcagctcttc agaggcccca gccgcggcca gggctcagaa cgagaggagc 120 
tgaaggcctt gagctgcctg cgctgtggca aacagttcac agtggcctgg aagctgctgc^lSO 

gtcacgccca gtgggaccac ggactgtcca tctaccagac agaatcagag gccccggagg 240 • 

ccccgtcctg ggcctggccg aggtggctgc agccgtgtcg gcagtggtgg ggccagcagc 300 

tgaggccaag agcccccgtg caagtggcag cggcctcacc cggcggagcc cacctgtcct 360 

gtgtgcaaga agaccctcag ctccttcagc aacctcaaag tgcacatgcg ctcacacaca 420 

ggcgagcggc cctatgcttg cgaccagtgt ccctacgcct gcgcccagag cagcaagctc 480 

aaccgccaca agaagaccca ccggcaggtg ccgccccaga gccccctcat ggccgacacc 540 

agccaggagc aggcctctgc agcccctccg gagccggctg tccatgctgc tgcccccacc 600 

agcacccttc catgcagcgg tggtgagggg gctggagccg ccgccacagc aggtgtccag 660 

gaacccgggg ctcctggcag tggggctcaa gccggccctg gtggagacac ttggggagcc 720 

atcaccacgg aacaaagaac tgaccctgca aacagccaga aggcatcacc caaaaagatg 780 

cccaagtcag ggggcaagag ccgcgggccc gggggcagct gtgagttctg cgggaagcat 840 

tttaccaaca gcagcaacct gacggtgcac cggcgctcac acaccgggga gcgcccctac 900 

acctgtgagt tctgcaacta cgcctgcgcc cagagcagta agctcaaccg ccaccgccgc 960 

atgcacggca tgacgcctgg cagcacccgc ttcgagtgcc cccactgcca tgtgcccttc 1020 

ggcctgcgag ccaccctgga caaacacctg cggcagaagc accctgaggc ggccggcgag 1080 

gcctgagccc aggaaagccc ccctcactgt ccctggtacc gctgccaaca cccattgacc 1140 

tcctcgtttt tgcccgcctt ctccaagtaa attttccctt ttattta 1187 

<210> 59 

<211> 1671 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 200009 . 1 :2000FEB01 
<220> 

<221> unsure 
<222> 1663 

<223> a, t, c, g, or other 
<400> 59 

gcggccgccg cgcccccgcc ggagctgccg gagtggctgc gggacctgcc tcgcgaggtg 60 
tgcctctgca ccagtactgt gcccggcctg gcctacggca tctgcgcggc gcagaggatc 120 
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cagcaaggca cctggattgg acctttccaa ggcgtgcttc tgcccccaga gaaggtgcag 180 
gcaggcgccg tgaggaacac gcagcatctc tgggagtaaa tgtcccttca acggtaatgg 240 
aagccatgtt gccgaccaga agcccttgag gcctttcacc aaaggcggca aatctcgccc 300 
ctaccaccca gcagcgctcc gttgttttcc cccagactcc gtgcagcagg aacttctctc 360 
ttctggataa gtctgcjgccc attgaatcag gatttaatca aatcaacgtg aaaaaccagc 420 
gagtcctggc aagcccaact tccacaagcc agctccactc ggagttcagt gactggcatc 480 
tttggaaatg tgggcagtgc tttaagactt tcacccagcg gatcctctta cagatgcacg 540 
tgtgcacgca gaaccccgac agaccctacc aatgcggcca ctgctcccag tccttttccc 600 
agccttcaga actgaggaac cacgtggtca ctcactctag tgaccggcct ttcaagtgcg 660 
gctactgtgg tcgtgccttt gccggggcca ccaccctcaa caaccacatc cgaacccaca 720 
ctggagaaaa gcccttcaag tgcgagaggt gtgagaggag cttcacgcag gccacccagc 780 
tgagccgaca ccagcggatg cccaatgagt gcaagccaat aactgagagc ccagaatcaa 840 
tcgaagtgga ttaacggatt gatggttgga attaaactgc aaggaaagtc atgattaaat 900 
gtcacggaca cttaagcaaa accaaagatt tcctctgagc aactttcaat cagtcccaga 960 
aaaccaaaag cagtaataaa ataagtaaga tgttaagaga tattgatcct ggcatggaag 1020 
tcagaccagg aaagagatta tttatttatg acttagggat gagacttatt tcagtggaca 1080 
actaacctgg gatggttaac atttccagtc ccaccatgta ttttgctttg tttctaaaaa 1140 
gctttttaaa aactgttatt taataccaaa gggaggaatc gtatgggttc ttctgcccac 1200 
cgttgtgact aagaatgcac agggacttgg ttctcgttgc accttttttt agtaacatgt 1260 
ttcatgggga cccactgtac agcccttcat tctgctgtgt cagtttggcc tggcctgaca 1320 
ctggctgccc cagcggggac cacggaagca gagtgagagc cttcgctgag tcaatgctac 1380 
cttcagcccc agacgcatcc catttccatg tcttccatgc tcactgctca tgcacttttt 1440 
acacggtttc ttccaaacag cccggtcttg atgcaggaga gtctggaaaa ggaagaaaat 1500 
ggtttcagtt tcaaaattca aaggaaaaag ttgaggactt attttgtcct gtcaagattg 1560 
eaagaaeatg taaaatgtac ggagcttcat aatacgttat attgttccga agcagctcgt 1620 
tgagaaacat ttgttttcaa taacatttta gcttaaaaaa aanaaaggaa a 1671 

<210> 60 

<211> 952 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID.No: LI : 758502 . 1 :2000FEB01 
<400> 60 

caacactcat caaacatcag agaacccaca caggggagag accctatgag tgcccagagt 60 

gtggaaagac ttttgggcgg aagccacacc tcataatgca ccaaagaacc cacacaggcg 120 

agaagcccta cgcgtgcctg gaatgtcaca aaagcttcag tcgaagctca aatttcatca 180 

ctcaccagag gacccacaca ggggtgaagc cttacaggtg taatgactgt ggggagagtt 240 

ttagccagag ctcggatttg attaagcacc aacgaaccca cacgggagaa cggcccttca 300 

aatgcccgga gtgcgggaag ggcttcagag atagttctca ttttgtagct cacatgagca 360 

ctcattcagg agagaggcct ttcagttgtc ctgactgcca caaaagcttc agtcagagct 420 

cacatttggt cacgcaccaa agaacacaca caggtgagag accttttaag tgcgaaaact 480 

gtgggaaagg attcgccgac agctccgccc tcactaagca ccaacgaatc cacaccggag 540 

aaagacccta caaatgtgga gagtgtggga agagcttcaa tcagagctcc cactttatta 600 

cccatcagcg aatccactta ggagacaggc cctatcgatg tcctgagtgt ggcaagacct 660 

tcaatcagcg ttcccatttc ctcacacacc agagaacgca tacaggagaa aaacctttcc 720 

actgtagtaa atgtaacaag agcttccgtc agaaagcgca tcttttatgc catcaagaca 780 

cccatttgat ttaggaagta gtctttggtg ttcagctgct cccttgcaca ttttcattgc 840 

tactgtcttc aagcacccca aatagagaaa acctgggcgt cagtggctca atttgggccc 900 

tgatctattc tccctctttc ttgtctatgt tataacagag aggatagact ta 952 

<210> 61 

<211> 749 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI: 344772. 1:2000PEB01 
<220> 

<221> unsure 
<222> 713 

<223> a, t, c, g, or other 
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<400> 61 

cgcaaaaggc tgtccctccc aaacctggga 
agccctacct gcactgctct ggtctagtac 
attcctagga tgacctggga atttatcccc 
gaagctgagc tgtcttcaac tgtattatat 
tgaaaaatga gtcagacctt tccactgtaa 
ctttaaaatg gacatagaag actgcaatgg 
atctctggag aaggagttcc tcggggcccc 
ccagcactct tctcctagcc gctcactcag 
cgatgaggag tcaagcagac tgctggggcc 
cgtgattgtg gaagattcat tgtctgagcc 
gcctcactcc cctgggggca tccggctgcc 
catggtctgt attggaccca acgtgctcat 
ggcccttcca ttgcaaccag tgtggtgcc 

<210> 62 
<211> 620 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: LI: 789445. 1:2000FEB01 

ggaacatcga gaagccaagg ctagtggctg ggttactgat ggattactga tggattcttc 60 
tcagcactta gtgacctttg aggatgtggc tgtggacttt acccaggagg agtggacttt 120 
gttggatcaa gcccagagag atctctacag agatgtgatg ttggagaact acaagaatct 180 
cattatacta gcagggtctg aattattcaa acgtagtctc atgtctggat tggaacaaat 240 
ggaagagctg aggacaggag tgacaggagt tctgcaggaa ttggatttgc aactcaaaac 300 
caaaggctcc ccactgctgc aagatatttc tgcagaaaga tcaccaaatg gagtacaatt 360 
ggagagaagc aatactgcag agaaactgta tgactctaac cattctggaa aagtcttcaa 420 
tgaacaccca tttcttatga ctcacatgat aactcacatt ggagagaaaa cttctgagga 480 
taatcagagt ggataagcct taagaaagaa ctttcctcat agtttttaca agaaaagtca 540 
tgctgaggga aaaatgccta agtgtgttaa acatgaaaaa gccttcaacc agtttccaaa 600 
tcttactagg cagaataaaa 620 

<210> 63 

<211> 1110 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_featiire 

<223> Incyte ID No: LI : 789657 . 1 :2000FEB01 
<220> 

<221> unsure 

<222> 449, 530 

<223> a, t, c, g, or other 

<400> 63 

ttagcagtgt aatgaacgtg gcaaaggttt taaatcaaaa agcaaagctt gcacatcagc 60 
atagaattca tactggagat aaacgttaca aatgtgaagc atgtgacaaa gtttacagtc 120 
gcaaatcaag cctcgaaaga caggagaatt catactggag agaaagctta catatgtgaa 180 
gaatgtcacc aagttttcag tcacactcaa accttgaaag acacagcaga attcctactg 240 
gagagatagc ataaaaatgt aagagtttgt gacaaggctt tcaggcataa ttcgcacctg 300 
gcacaacatc ctagaattca cattggagag aaagcttaca agtataatga atgtgacagg 360 
tctttagtgg gcagtcaaca cttgtttacc atcaggcaat ccatggtgta gggaaacttt 420 
acttatgtaa tgattgtcac aaagtcttna gttacactac aaccattgcg aatcattgga 480 
gaatccataa tgaataaaga tctaacaagt gtaatgaatg tggcaagtcn ttagtaaaag 540 
ttcccacctt gcagttcatc agagaattca tactggagag aaaccttaca aatgtaatcg 600 
atgtggcaag tgcttcagtc aaagttcctc tcttgcaagg catcagacag ttcatacagg 660 
agagaaacct tacatatgtg ctgaatgtgg aaaggccttt tctcagaagt cagaccttgt 720 
tgtacatcag ataattcata ctggagagaa acctgatcga tgtactgtat gtgggaaggc 780 
cttcatccag aagtcccaac tcactgtaca tcagagaatt catacactaa tgaaatcata 840 
agaatggtct gaacacagaa aagccttcag ggtcagttca agccttaata gatagtgcaa 900 



ttctgggctc actgagttca cctgcgagtc 60 
aaacaggctg ctggcattga gagactggcc 120 
tctgcactgc tccccaagca cttggataat 180 
gtagacagaa ggtcaccatc tacagcctgg 240 
agacaaacac atgagctgag ctaagaatct 300 
ccgctcctat gtgtctggta gcggggactc 360 
agtggggccc tcggtgagca cccccaacag 420 
tgccaactcc atcaaggtgg agatgtacag 480 
agatgagcgg ctcctggaaa aggacgacag 540 
cctgggctac tgtgatggga gtgggccaga 600 
caatggcaag ctcaagtgtg acgtctgcgg 660 
ggtgcacaag cgcagtcaca ctnggtgaaa 720 

749 
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caaccaatgg atttgatgat tttggggact acatctttgt tgataaaatt ttacaagtga 960 

agtcatgttc ctaatgtatt tcattcttta tcaaagataa tagagaagtc aatacgtaaa 1020 

tgatggacat tttcactatg gcatataaaa gtttttaaat tgagaaatga atgattagca 1080 

taacagaacg aattgcatgt acatctcttt 1110 

<210> 64 

<211> 899 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI:789808. 1 :2000FEB01 
<400> 64 

gaaatcaaga attcacacag gggagagcct ttacagatgt catgaatgga gaaatccttc 60 
agtgaaaagt catcccttac tcaaaatcag agaacacacg tggggagaaa tcatgaatgt 120 
catgaatgtg ggaaaacctc gtttaagtca gttctaactg tgcatcagaa aacacacagg 180 
ggagaagccc tatgaatgct atgcatgtgg caacaccttt ctcagaaaat ccgacctcat 240 
taaacatcag agaattcaca caggagaaaa accttatgaa tgtaatgaat gtgggaagtc 300 
attctccgag aagtcaaccc ttactaaaca tctaagaaca cacagatggg aaatcttatg 3 60 
catgtattca atgtggaaaa tttttctgct gctactacag tttcacagaa catctgagaa 420 
gacacacagg ggagaaacct tttggatgta atgaatgtgg gaaaaccttc catcagaagt 480 
tggccctaat tgttcaccag agaactcata taagacagaa accctatgga tgtaatgaat 540 
gtggaaaatc attctgtgtg aagtcaaaac tcattgcaca tcatagaaca tacacagggg 600 
agaaacccta tgaatgtaat gtttgtggaa aattattatt aagtcaaaac taactgtaca 660 
tcacagaaca cacttgaggt gaaaccctat aaatgtagta agtgagggaa attactctgg 720 
gtgaagtcag aactttgtag agcagagaac ataaagggtg agagaaatct gttaatataa 780 
tgataatgag aacacctttg ccctgaagtc agttctcaca gtagagaaga gaacttaaag 840 
agggaaaaaa caatatgaag atatggaatg caggaaaaca ttattctggg atttgggcc 899 

<210> 65 

<211> 476 

<212> DNA. 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 792919 . 1 :2000FEB01 
<400> 65 

catcagatga ttcatatggg acagaaccca tataattgta aagaatgtgg gaagagcttc 60 
aaatggtcct catatctttt ggtccatcaa cgagtccaca ctggagaaaa gccatacaaa 120 
tgtgaggagt gtgggaaggg ctacattagt aagtcaggtc ttgacttcca ccatagaacc 180 
cacacgggag agagatctta taactgtgat aactgcggga agagctttag acatgcttct 240 
agtattttga atcataagaa actccactgc caaagaaagc cattgaaatg tgaagactgt 300 
ggaaagaggc ttgtatgccg gtcatactgt aaagaccaac aaagagacca cagtggagaa 360 
aacccatcca aatgtgagga ctgtgggaag cgctacaaga ggcgcttgaa tctggatata 420 
attttatcat tatttttaaa tgacatataa gttatacata tttatggagt gtgaaa 476 

<210> 66 

<211> 597 4 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 793949 . 1 : 2000FEB01 
<400> 66 

tgcagtcaga gctccaggtc tggttcttct cctaaaggcc caggctgtgt ggccccgtgt 60 
cctgcaggta ttgggagatc cacagctaag acaccgggac ctcctggaag ccaaaaatgg 120 
gaccattgca atttagagat gtggccatag acttctccca ggaggagtgg cattgcctgg 180 
acactgctca aagagatttg tacagatgtg taatgttgga gaactacagc aacctggtct 240 
tccttggtat tactgtttct aaaccagatg tgatttcctc attggagcaa ggaagaaaac 300 
ctttgaccat gaagagaaat gagatgatag ccaaaccctc agtgagcttc cttcaagttc 360 
acagtgagag ccaaagtcct cttcatgaca tataagagac tgcacagtgt ggctgctttt 420 
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ccataatttt tgggacgcac aaatgtctgc atgattttga gaaactaaaa ttactctcaa 480 
agttctcttt ttgcatcaga tctgaaatgt ctgagagtaa tagtttctgt tgaatttttt 540 
tttgttcatt tttctgcaca gtccattctg tttttattac tatctaggct tgaaata 597 

<210> 67 
, <211> 523 
<212> DNA 
<213> Homo sapiens 

<220> i 
<221> misc_feature 

<223> Incyte ID No: LI : 794389 . 1 :2000FEB01 
<220> 

<221> unsure 
<222> 74 

<223> a, t, c, g, or other 



<400> 67 

tgtaaggact 

atactggtga 

ctcttgttac 

gaaaagcctt 

aaccctatac 

ataafeagaat 

gctgcagctc 

caaaactcaa 

tctttttaaa 



gcaaaagacc 
aaanccctat 
tcataagaga 
cagtgcgcac 
atgccatgcc 

tC3t3Cty y t 

tcaccttact 
aatgttaatt 
tatgctctgt 



ttttgtcgtg 
gaatgcaagg 
acacacagtg 
tcttcccttg 
tgtgggaagg 
gaaaaaccct 
cgacactgta 
ctgattattc 
tagtttgtat 



tgatgcagtt 
aatgtggaaa 
gagaaaaacc 
ttactcataa 
cctttaatac 
ttcagtgcag 
gaatgtgtaa 
tgtatcatca 
ggaatgatat 



cactctgcac 
gtccttcagc 
gtataaatgc 
gagaacacac 
ttcctccaca 
tcaatgcggg 
tggaaaattt 
ccagtacttt 
ttc 



aggagaattc 60 
gcccattctt 120 
aaggaatgtg 180 
agtggagaga 240 
ctttgtcaac 300 
aagtcctt'ta 360 
agcaaataac 420 
gtactttgag 480 
523 



<210> 68 

<211> 561 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



LI:796010.1:2000FEB01 



<220> 

<221> unsure 

<222> 92, 176, 216, 233 

<223> a, t, c, g, or other 



<400> 68 

ctcattagaa 

atccttcagt 

tgaatgtcat 

acacagggga 

acctcattaa 

ggaagtcatt 

tcttatgcat 

ctgagaagac 

cagaagttgg 

aatgaatgtg 



atcaaagaat 
gaaaagtcat 
gaatgtggga 
gaagccctat 
acatcagaga 
ctccgagaag 
gtattcaatg 
acacagggga 
ccctaattgt 
gaaaatcatt 



tcacacgggg 
cccttactca 
aaacctcgtt 
gaatgctatg 
attcacacag 
tcaaccctta 
tggaaaattt 
gaaacctttt 
tcaccagaga 



agagacctta 
anatcagaga 
taagtcagtt 
catgtngcaa 
gagaaaaacc 
ctaaacatct 
ttctgctgct 
ggatgtaatg 
actcatataa 



cagatgtcat 
acacacgtgg 
ctaactgtgc 
cacctttctc 
ttatgaatgt 
aagaacacac 
actacagttt 
aatgtgggaa 
gacagaaacc 



gaatggagaa 60 
ggagaaatca 120 
atcagnaaac 180 
agnaaatccg 240 
aatgaatgtg 3 00 
agatgggaaa 360 
cacagaacat 420 
aaccttccat 480 
ctatggatgt 540 
561 



<210> 69 

<211> 543 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 

<220> 

<221> unsure 
<222> 51, 54, 458 



LI:796324.1:2000PEB01 
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<223> a, t # c, g, or other 
<400> 69 

tctataagaa ttcagccctc attagaaatc aaagattcac acaggggaga nacnttacag 60 
atgtcatgaa tggagaaatc cttcagtgaa aagtcatccc ttactcaaaa tcagagaaca 120 
cacgtgggga gaaatcatga atgtcatgaa tgtgggaaaa cctcgtttaa gtcagttcta 180 
actgtgcatc agaaaacaca caggggagaa gccctatgaa tgctatgcat gtggcaacac 240 
ctttctcaga aaatccgacc tcattaaaca tcagagaatt cacacaggag aaaaacctta 3 00 
tgaatgtaat gaatgtggga agtcattctc cgagaagtca acccttacta aacatctaag 3 60 
aacacacaga tgggaaatct tatgcatgta ttcaatgtgg aaaatttttc tgctgctact 420 
acagtttcac agaacatctg agaagacaca cagggganaa accttttgga tgtaatgaat 480 
gtgggaaaac cttccatcag aagttggccc taattgttca ccagagaact catataagac 540 
aga 543 

<210> 70 

<2U> 567 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte IDjNo: LI:796373 . 1 :2000FEB01 
<400> 70 

cactggagag aaaccctatg tatgcaatga atgtgggaaa gcttcagcca gaagacatgt 60 
ttaatatccc atcagagatt tcacacagga aagacaccct ttgtatgtac tgagtgtgga 120 
aaatcctgct cacacaagtc aggtctcatt aaccaccaga gaattcacac aggagagaaa 180 
ccctatacat gcagtgactg tgggaaagct ttcagagata aatcatgtct caacagacat 240 
cggagaactc atacagggga gagaccgtat ggatgctctg attgtgggaa agctttctcc 300 
cacttgtcat gccttgttta tcataaggga atgctgcatg caagagagaa atgtgtaggt 360 
tcagtcaaat tggaaaatcc ttgctcagag agtcatagct tatcacatac acgtgatctc 420 
atacaggata aagactctgt taacatggtg actctgcaga tgccttctgt ggcagctcag 480 
acctcattaa ctaacagtgc gttccaagca gagagcaaag tagccattgt gagccagcct 540 
gttgccagaa gttcagtctc agcagat 567 

<210> 71 

<211> 633 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI:796415. 1:2000FEB01 
<400> 71 

ctctgctctg ccaacactag tggaatatga tcacatccca gggatcagtg tcatttaggg 60 

atgtgactat gggcttcact caagaggagt ggcatcatct ggaccctgct cagaggaccc 120 

tgtagaggaa tgtgatgctg gagaactaca gccaccttgt ctcagtaggg tattgcattc 180 

ctaaaccaga agtgatcctc aaattggaga caggcaagga gccatggata ttagaggaaa 240 

aatttcgaag ccagagtcat ctgggtgagt tagtgccaga tggaatttaa agaattaatt 300 

aataccagta gaaactattc aagaatgaag ttcaatgagt ttaacaaagg tggaaaatgt 360 

ttctgtgatg aaagcatgaa ataattcatt ttgaagagga accttctgaa tataataaca 420 

atgggaacag cttctggctg aatgaagacc tcatttggca tcagaagatt aaaaattggg 480 

agcaaccttt tgaatacaat gaatgtggga aagctttccc tgagaattca ctcttccttg 540 

tacataagag agcttacaca ggacagaaaa catgcaaata tactgaacat gggaaaacct 600 

gttatatgtc attttttatt actcatcagc aaa 633 

<210> 72 

<211> 652 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 798636 . 1 :2000FEB01 
<220> 
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<221> unsure 
<222> 76 

<223> a, 't, c, g, or other 
<400> 72 ^ 

taatgaatgt ggtaaagcct taagctccca ctcaacactt attattcatg agcgaattca 60 

tactggagaa aaaccntgta aatgtaaagt atgtggaaaa gccttcagac agagttccgc 120 

tctcattcaa catcagagaa tgcatactgg agaaagaccc tataagtgta acgaatgtga 180 

caaaacattc aggtgtaact catcgcttag taatcaccag agaattcata ctggagagaa 240 
accatatcga tgtttagaat gtgggatgtc ttttggccaa agtgcagctc ttatacaaca 300 

tcagaggatt catacaggag aaaaaccctt taaatgtaat acatgtggaa aaacttttag 360 

acaaagctca tcacttattg cacatcaaag aattcatact ggagagaaac cctatgaatg 420 

taatgcatgt gggaaactct ttagccagag gtcatccctt actaatcatt ataaaattca 480 

cattgaagag gactccttaa aagccgattt gcatgtgtga aagccttaaa ccaaaactca 540 

tcagagaata catgcttgag agtgatttat taaatataat gaatattgag aaaactctta 600 

gttctcatca gatactaagt tttaagaata aactttagct atgtaataac tt 652 

<210> 73 

<211> 860 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: til : 800045 . 1 :2000FEB01 
<220> 

<221> unsure 

<222> 311, 500, 509, 541-543 
<223> a, t, c, g, or other 

<400> 73 

aaaattatgc acactgggga aaagcgctat gaatgtgatg actgtggagg gactttccgg 60 
agcagctcga gccttcgggt ccacaaacgg atccacactg gggagaagcc gtacaagtgt 120 
gaggaatgtg ggaaagccta catgtcctac tccagcctta taaaccacaa aagcacccat 180 
tctggggaga agaactgtaa atgtgatgaa tgtggaaaat ccttcaatta tagctctgtt 240 
ctggaccagc ataaaaggat ccacactggg gagaagccct atgaatgtgg tgagtgtggg 300 
aaggccttca ngaacagctc tgggctcaga gtccacaaaa ggatccacac gggggagaag 3 60 
ccctatgaat gcgacatctg tgggaaaacc ttcagtaaca gctctggcct cacagtgcat 420 
aaaaggatcc acacagtttc agatgaactc ccataatgaa tgatgaattt gtgatgaggg 480 
ataccctgga agtggtattn acacattang ctacaataaa aggttctacc gtggagagga 540 
nnntgacaca ttcagtaact aatggaacac accgtcaaca tgaattcgca ccttacatga 600 
cagaagtgat tcagggattc ctatgaatag aaatgctgag aaggaacgca ttttattgca 660 
gaagctaaaa agctaaagta. ccagtcatct agagagaagg aaattaatgt ttcttaataa 720 
tcctgttaaa tgtttgattg tttttggaat gtgttattgt aaagatgtca tgcaggacat 780 
gtatatgttg tctgttgtaa aatgttaacg aatactttgt tcagggctca ctctctcttt 840 
gtcatgaaag ccagctcctt 860 

<210> 74 

<211> 501 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 800680 . 1 :200CIPEB01 
<400> 74 

gcgctctcgg cccacacaat atgacctcgg ggaggatgcg aggaagatga actgtgatga 60 

tccacttctt cttaatgaat gactgactta cctgagaaag aaactcagag gaagaggaaa 120 

gaaagaagag gagggaatgg ctctttctca gggactgttt acattcaagg atgtggccat 180 

agaattctct caagaggagt gggagtgcct ggaccctgcc cagagggcct tgtacaggga 240 

cgtgatgttg gagaactaca ggaacctgct ttctctcgat gaggataaca tccctccaga 300 

agatggttct caccttgcag cctgtggaca gagcacactg cctcttcctt agaatcctac 3 60 

aaaatccgac ccttttattt tactcccgta ccttttattt ctctcccttt tctagctttc 420 

tactctatcc gggaacttag ggagacctta gtcttcctga ttccatattc ttctaagatt 480 
tgcctctgaa agttgctttt g 501 
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<210> 75 

<211> 703 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 800894 . 1 :2000FEB01 



<400> 75 

gtctcgtcct tactgctgtg tgtcctctgc 
acctgcaggt attgggagat ccacaactaa 
gggatcattg acatttaggg atgtggccat 
ggacactgca cagcagaatt tatatagaaa 
cttcctgggt attgctgcct ttaagccaga 
gtcctggaat atgaagagac atgagatggt 
tgctcaagat ctttggccag agcagggcat 
aagatacaag attcatcatc atgcctgtga 
ctgtggccaa atgcatatat gacagttaca 
aatttaggtc accagtgggc tctctccatg 
ggtgtgcata ggagagtcac aatctcacct 
actaccttaa gactgtatga cacatgggtg 

<210> 76 

<211> 256 

<212> DNA 

<213> Homo sapiens 



tcctagagtc ccagcctctg tggccctatg 60 
gtcgccagga ccccctggaa gcctagaaat 120 
agaattctct ctggaggagt ggcaatgcct 180 
tgtgatgtta gagaactaca gaaacctggt 240 
cctgatcatt tttctggagg aaggaaaaga 300 
ggaagaatcc ccagttatat gttctcattt 360 
agaagattct ttccaaaaag tgatattgag 420 
gctgggtcca ataatgaatc actatcctac 480 
actctaactg cagactgcat cggcatgtaa 540 
tttgaatgtg gtgattctaa tggtcagcga 600 
tcaggtgggc cttgtatcag cactctctct 660 
agctaaaaat gtt 703 



<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 801015 . 1 : 2000FEB01 
<400> 76 

ggaagccgga aaatggactc agtggccttt gaggatgtgg cagtgaactt cacccaggag 60 
gagtgggctt tgctggatcc ttggcagaaa aaactctaca gagatgtgat gctggaaacc 120 
tataggaacc tggcttcagt aggtgatgac gacaacattc cttcacttag agaacaagtt 180 
gcccatcaac gatatttcaa gacctggcat gtggaaaggg aatacttcag taaataaacc 240 
aagcatggtg acagct 256 

<210> 77 
<211> 458 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 80123 6 . 1 :2000FEB01 
<220> 

<221> unsure 

<222> 11, 25, 58, 61, 91, 428 
<223> a, t, c, g, or other 

<400> 77 

aaagtttttt ngcccacagt acatnattaa tccgacatca gagaacccac acaggagngg 60 
naaccatatg aatgtaatga atgtgggaaa ntttcagcca taccccagcc ttcattcaac 120 
atcaaagaat tcatactgga gaaaaaccct atgagtgtaa tgcatgtgga aaggccttca 180 
atcggagtgc acatcttact gaacaccaga gaactcatac tggagagaag ccctatgttt 240 
gtaaagaatg tggaaaaacc ttcagtcgaa gtacacacct tactgaacat ctaaaaattc 3 00 
attcttgtgt gaaaccctat caatgtaatg aatgtcagaa actgttttgc tatagaacat 3 60 
cactaattcg acatcagagg acgcatacag gagagaaacc ctaccagtgt aatgaatgtg 420 
ggaaatcntt cagcttaagc tcagctctaa ctaaacat 458 

<210> 78 

<211> 604 

<212> DNA 

<213> Homo sapiens 



36/228 



WO 01/62927 



PCT/US01/06059 



<220> 

<221> mis cofeature 

<223> Incyte ID NO: LI : 803335. 1 :2000FEB01 



<400> 78 

gggccagcct ggggcttggt tctatgtccc tgcgggtcgg tgcgagggcg aagaggaacc 60 
cgtgggcccg ggggatcccg gggggccgga ccagtgttcc ccagttgtgg gagcagacgc 120 
gtgggcgcat cgcgggcggg cagggcctga agtgcagcta tgtttccagt gttctctggc 180 
tgtttccaag agctacaaga aaagaataaa tctctggagt tggtgtcctt tgaggaggta 240 
gctgtgcact tcacctggga ggagtggcag gacctggatg acgctcagag gaccctgtac 300 
agggacgtga tgctggagac ctacagcagc ctggtatcat tggggcattg cattaccaaa 3 60 
cc tgagatga tcttcaagct agagcaagga gcagagccat ggatagtaga agaaacccta 420 
aacctgagac tttcaggcta agcctccctc cttttggtag aggctttttt ataccatttg 480 
ggaactggtt tggataagct accttaatga aggacgacac atccctccag agatgataaa 540 
agactttttg tcttttctga tttaagaaaa ctttatctcg taagctacct atagtttaca 600 
aaaa 604 

<210> 79 

<211> 641 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID NO: LI : 803 998 . 1 : 2000FEB01 
<400> 79 

aggcttcaac cagtcctcaa accttatgga acataagaga attcatactg gtgagaagcc 60 
ctacaaatgt gaagaatgtg ataaagactt caactggtcc tcacacctta ctactcataa 120 
aagaattcat actggaggaa aaaccctaca aatatgaaaa atgtggcaaa actattaatt 180 
aattctcaac gcttactgaa cataagggaa tttgtacagg agggaaaccc tacaaatctg 240 
aagaatgtgg caagcctttt agctattcct caatccttac taaacattca tgtaattcat 300 
actggaggga attcctacaa ttgtgtggaa tgttgcaatg cccttaacca gtccttaagg 360 
cttactacat ataagacaac tcatactgga gagaaaccat gcatgtgtga agaatgtggg 420 
aaagcctcta atagatcctc aattcttaag agacataagc taattcatac acaagagaga 480 
ctctacaaac ctgaaagatg tgacaatgct tttggcaaca cctcagactt ttctgaatat 540 
aaaagaaatc gtacagatga gaaatcctag aaaagtgaag aatatgacaa agcccttaag 600 
tggctgtcac acttgattgt aggtaagata actcatactg g 641 

<210> 80 

<211> 991 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 47 8757 . 1 :2000FEB01 
<400> 80 

gtgagaaggg aaaggagggg tgagggtgca ggggccctcc taggaggaca tcggacctgt 60 
tgggccgaga cctcattaga gatctagagc accggatggg aggagcagag gctcgatttg 120 
tgtgctgtcc agactcccac aggctgggtg atggtgggag atttgtgccc attcagatgt 180 
ggcggcagag gagggcaagg cccgaaggag cagacagcac cgcttcttgg ggacttgtga 240 
aggcatcatg cggagggccg agcttagcag ccaagtggag gacagcaccc tccatgcctg 300 
gattcgttac tcgctcgttc tcgatgttga ctgctggcat attgcagcac aactagagat 360 
gtacggatgc ccccatcttg atcttacaga atccagaggt gcagccgcaa gaaagctaca 420 
ccttctcggt ttctctgctc tgccaacact agtggatatg atcacatccc agggatcagt 480 
gtcatttagg gatgtgacta tgggcttcac tcaagaggag tggcatcatc tggaccctgc 540 
tcagaggacc ctgtagagga atgtgatgct ggagaactac agccaccttg tctcagtagg 600 
gtattgcatc attcacaggg tattgcattc ctaaaccaga agtgatcctc aaattggaga 660 
caggcaagga gccatggata ttagaggaaa aatttcgaag ccagagtcat ctggaattaa 720 
ttaataccag tagaaactat tcaagaatga agttcaatga gtttaacaaa ggtggaaaat 780 
gtttctgtga tgaaagcatg aaataattca ttttgaagag gaaccttctg aatataataa 840 
caatgggaac agcttctggc tgaatgaaga cctcatttgg catcagaaga ttaaaaattg 900 
ggagcaacct tttgaataca gtgaatgtgg gaaagctttc cctgagaatt cactcttcct 960 
tgtacataag agagcttaca caggacaaaa a 991 
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<210> 81 

<211> 680 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc — feature 

<223> Incyte ID No: LI: 808532. 1:2000PEB01 
<400> 81 

gcacaatttc cagcttcaga aacatcacag aattcatact ggggagaagc cattcaaatg 60 

tgaaatatgt ggtaagagct tctgtcttag gtcaagtctt aataggcatt gcatggtcca 120 

cacagcagag aaactgtaca aatctgaaaa gtatggaaga ggtttcattg ataggctaga 180 

tttgcataag catcagatga ttcatatggg acagaaacca tataattgta aagaatgtgg 240 

gaagagcttc aaatggtcct catatctttt ggtccatcaa cgagtccaca ctggagaaaa 300 

gccatacaaa tgtgaggagt gtgggaaggg ctacattagt aagtcaggtc ttgacttcca 360 

ccatagaacc cacacgggag agagatctta taactgtgat aactgcggga agagctttag 420 

acatgcttct agtattttga atcataagaa actccactgc caaagaaagc cattgaaatg 480 

tgaagactgt ggaaagaggc ttgtatgccg gtcatactgt aaagaccaac aaagagacca 540 

cagtggagaa aacccatcca aatgtgagga ctgtgggaag cgctacaaga ggcgcttgaa 600 

tctggatata attttatcat tatttttaaa tgacatataa gttatacata tttatggagt 660 

gtgaaatttg atacatgtat 680 

<210> 82 

<211> 420 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 443073 . 1 :2000FEB01 
<400> 82 

aaaaccctat atgtgtaagg aatgtaggaa aacatttagc cagaatgcag gccttgctca 60 
acatcagaga attcatactg gagagaaacc ttatgaatgt aatgtttgtg ggaaagcatt 120 
tagctatagt ggatctctta ctctacatca gagaattcat actggagaaa gaccctatga 180 
atgtaaagat tgcaggaaat ctttcaggca gcgtgcacat cttgctcatc atgagagaat 240 
tcatactatg gagtcattct tgactctttc ctctccctca ccctccacat caaatcagtt 300 
gccaagacct gtaggtttca tctcctgaat atttctggaa tccacctctt gaatccattt 360 
ccatcccatc atccttgtcc aatgcacatt aatatatttg acatgggata ctcgagtagc 420 

<210> 83 

<211> 659 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: IiI:479671.1:2000FEB01 
<400> 83 

cgtgcccgca cgtgcgtgtc tcggtcagta gccctgcgct tctccttcac tctcggcggt 60 

tcaggaggct ctgccgcagc cggggccctc ctgtgacctg catgtactgg gggattcgca 120 

gggaggatgt cgggacaccc cggaagctgg gaaatgaact cggttgcctt tgaggatgtg 180 

gctgtgaact tcacccagga ggagtgggct ttgttggatc cttcccagaa gaatctctat 240 

agagacgtga tgcaggaaac cttcaggaat ctggcttcca taggaaacaa aggggaagac 300 

cagagcattg aagatcagta caaaaattct tcaagaaatc taagatgagg tttatataac 360 

tttccaagac aaggaagttg tgcaaagcct cacagctact aagaggctgg gctgggactc 420 

aggattaact tcgtgactcc aaagcccatg cacctctgca agccacacga atcaagcagc 480 

ttctctaaac ctttgaagtc ctggaaagta actcttctgc aatagaagca agaatcctgt 540 

gagattggga tccatctagt actcttcacc atttggaaca actgcctcca gagaatgctt 600 

ctgacgcatc ttggaagaac ttggaataac ctgaggaaat ggtcgatttg ctaatttag 659 

<210> 84 

<211> 772 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 810078 . 1 :2000FEB01 
<400> 84 

ccctatgtat gtaaggaatg tgggaaagcc ttcactcagt actcgggcct tagtatgcat 60 
gtacgatctc acagtggaga caagccctat gaatgtaagg aatgtgggaa atccttcctt 120 
acatcctcac gccttattca acatataaga actcacactg gagagaagcc ttttgtatgt 180 
gttgaatgtg ggaaagcctt tgcagtttcc tcaaatctta gtggacattt gagaactcac 240 
actgaagaga aggcctgtga gtgtaagata tgtgggaaag tatttgggta tccctcatgt 300 
cttaataatc acatgcgaac gcacagtgcc cagaaaccat acacctgtaa ggaatgtggg 360 
aaggctttta actattccac ccaccttaaa attcacatgc gaatccacac tggagaaaaa 420 
ccctatgagt gtaaacaatg tggaaaggcc ttcagtcatt ccagttcatt tcaaatacat 480 
gaaaggactc acactggaga gaaaccctat gaatgcaagg agtgtgggaa agccttcacg 540 
tgaagagagt tcctttagaa ttcatgaaaa aactcacaca gaagagaaac cctataaatg 600 
tcagcaatgc gggaaagctt acagtcatcc ccgttcactt cgaagacatg aacaaattca 660 
ctagtgagaa actgtccatg taataaatgt gggaaagctc tcatttgttc cagttcactt 720 
taaagacatg aatgaactca ctctggagag aagaagctgc atgaaaatta ct 772 

<210> 85 

<211> 441 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 810224 . 1 :2000FEB01 
<400> 85 

ggggagaaat catgaatgtc atgaatgtgg gaaaccctcg tttaagtcag ttctaactgt 60 
gcatcagcaa acacccaggg gagaagccct atgaatgcta tgcatgtggc aacacctttc 120 
tcagaaaatc cgacctcatt aaacatcaga gaattcacac aggagaaaaa ccttatgaat 180 
gtaatgaatg tgggaagtca ttctccgaga agtcaaccct tactaaacat ctaagaacac 240 
acagatggga aatcttatgc atgtattcaa tgtggaaaat ttttctgctg ctactacagt 300 
ttcacagaac atctgagaag ccacacaggg gagaaacctt ttggatgtaa tgaatgtggg 360 
aaaaccttcc atcagaagtt gccctaattg ttcaccaaga gaactcaata taatgacaga 420 
cacgcactat agatgtactg a 441 

<210> 86 

<211> 1072 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID NO: LI: 817052 .2 :2000FEB01 
<400> 86 

aattccccgg gtcgacccac gcgtccgggt ggaagagcac ggcttccagg actcagagct 60 
gggtgacccc tgtggggaac agccagacct ggacatgcag gagccagaga acacgctgga 120 
ggagtccacg gaaggctcca gcgagttcag cgaactgaag cagatgctgg tgcagcagag 180 
gaactgcacg gaggggatcg tgatcaagac agaggaacaa gacgaccact tacaccacga 240 
ggaggaggat gagctgccgc agcacttgca atcccttggg cagctgtccg ggagatatga 300 
ggccagtatg taccagaccc cgctgcccgg ggagatgtcc cccgagggcg aggagagccc 360 
cccgcccctg cagctaggaa accccgcagt gaaaaggctg gcgccctccg tgcacggtga 420 
gcggcacctg agagccagca gcagcggaac cggcgcggcg agcggccctt cacatgcatg 480 
gagtgcggca agagcttccg cctgaagatc aacctcatca * tccaccagcg caaccacatc 540 
aaggaggggc cc tacgagtg cgccgaatgc gaga t cage t tccggcacaa gcaacagctc 600 
acgctgcacc agcgcatcca ccgcgtgcgc ggaggctgcg tctcacccga acgcgggccc 660 
acgttcaacc ccaagcacgc gctcaagccg cgtcccaagt cacccagctc tggtagegge 720 
ggcggtggcc ctaagcccta caagtgcccc gagtgegaca gcagcttcag ccacaagtcc 780 
agcctgacca aacaccagat cacgcacacg ggtgagcggc cctacacgtg ccccgagtgc 840 
aagaagagct tccgcctgca catcagcttg gtgatccatc agegegtgea cgcgggcaag 900 
catgaggtct ccttcatctg cagcctgtgc ggcaagagct tcagccgccc ctcgcacctg 960 
ctgcgccacc ageggactea cacaggegag cggcccttca agtgccccga gtgcgagaag 1020 
agettcageg agaagtccaa gctcaccaac cactgccgcg tgcactcgcg eg 1072 
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<210> 87 
<211> 759 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 892274 . 1 : 2000MAY19 
<400> 87 

ctgtcgggaa atgagatggg 
gtccagacct taaccctgtt 
caggaacacc aggaagtgca 
ggggagacaa acccatcaaa 
tgggactaat tttaggcaaa 
ctgttgactc tgattatgaa 
tttttgaact gggagaatat 
ctccatgaaa ggagaaacga 
atctatcccg acccatagcc 
aattttatgg gcttatggat 
ccacatcctg gctcttgcaa 
gtgttcaaca gagtgctgag 
ttcagcctta tgttgcaaat 

<210> 88 
<211> 570 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 1080959 . 1 : 2000MAY19 
<400> 88 

aaaaaaacaa gttgctttat 
accaaagcag ggaataaatg 
gtctaggccc ccaatgctgt 
ggacccagtg gcttttaagg 
ggatatttcc cagaagaatc 
ctctatagga aaaaagtgga 
aaacttcagg agtgtcacag 
agaaactttt accccagttc 
tgaagtaaaa tcatgtgaca 
tatgaacatc agaggtgaca 

<210> 89 
<211> 408 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: LG: 1054900 . 1 : 2000MAY19 

<400> 89 

cggacgcgtg ggctaggccc ccagtgctgt cactctcacc catcctcctc tacacatgtg 60 
agatgtttca ggacccagtg gcctttgatg atgttgctgt gaacttcacc caggaggagt 120 
gggctttgct ggatatttcc cagaggaaac tctacaagga agtgatgctg gaaactttca 180 
ggaacctgac ctctgtagga aaaagttgga aagaccagaa cattgaatat gagtaccaaa 240 
accccaggag aaacttcagg agtctcatag aaaagaaagt caatgaaatt aaagatgaca 300 
gtcattgtgg agaaactttt acccaggttc cagatgacag gctgaacttc caggagaaga 360 
aagcttctcc tgaaataaaa tcatgtgaca gctttgtgtg tggaaaac 408 

<210> 90 

<211> 532 

<212> DNA 

<213> Homo sapiens 



ggcctggact tgggtcccac 
ttaggcatgg gttcctggag 
ggattagatc tacctgccag 
gttcccactg gtatttgggg 
agccgcctta acttgcaagg 
ggagaaactc aagtagtttt 
attgctcaat tattgcttat 
ggaaataaag ggtttgggag 
tctaatagac ccacctgtgt 
acgggaactg atgtatcagt 
ttaacttcta catccctagt 
attttacctt gtcttggtcc 
atagctatca gtttatggg 



aaacagtgag ggcattccca 60 
gcacattgat tccatcaccc 120 
agaaagaatc acattagttg 180 
aacttcacca gcaggataca 240 
catgactgta gtcccaggag 300 
aatgtcacaa gatctttggg 360 
tccctgcaaa ttacaccctt 420 
cacaactaca tgggaaatct 480 
agtacaaatt aaaggaaaga 540 
aatatctaaa gacaattggc 600 
ggaagtagga acagctcaaa 660 
ggatgggcag tcatgtactt 720 

759 



ggaagaaagt aagtatagac 
tttggagtcc acggcatcct 
cactctcacc catcctcctc 
atgtggctgt gaacttcacc 
tctacaggga agtgatgctg 
aagaccagaa cattgaatat 
aagagaaagt caatgaaatt 
cagatgacag gctgaacttc 
gctttgtgtg tgaagttggc 
ctggacacaa 



agagagaaag ggatctgatg 60 
gagaacttct tgggaataga 120 
tacacatgtg agatgtttca 180 
caggaggagt gggctttgct 240 
gaaactttct ggaacctgac 300 
gagtaccaaa accccaggag 360 
aaagaagaca gtcattgtgg 420 
cagaagaaga aagcttctcc 480 
ctaggtaact catcttctaa 540 

570 
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<220> 

<221> raisc_feature 

<223> Incyte ID ,No: LG : 1077357 . 1 : 2000MAY19 
<400> 90 

cattcatgga ttgacttata aacagtcatg ctatgtgatg aagaagccca gaagaggaaa 60 
gcaaaggagt cagggatggc tcttcctcag ggacgcttga cattcatgga cgtggccatc 120 
gaattctctc aggaggagtg gaaatccctg gaccctggac agagggcttt atacagggac 180 
gtgatgttgg agaactacag gaacctggtc tttctgggaa tctgtcttcc tgacctaagt 240 
attatttcca tgttgaagca aaggagagag cccttgattc tgcaaagtca agttaaaata 300 
gtaaaaaata cagatggaag ggaatgtgtc agaagcgtga acacagggag gagctgtgta 360 
ttgggaagca atgcagaaaa caagcctatt aaaaatcaac ttggattaac ccttgagtca 420 
catctgtctg aattgcagct gtttcaagcc ggaaggaaaa tttacagaag taatccagtt 480 
gaaaagttta caaaccatcg ttcctcagtt tcaccacttc aaaaaatttc tt 532 

<210> 91 

<211> 546 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1084051 . 1 :2000MAY19 
<400> 91 

caacaagtta cataagaaca aaaacctatg aatgtaatat atgtgaaaaa atcttcaaac 60 

aacctattca ccttactgaa catatgagaa ttcatactgg tgagaaacct ttcagatgta 120 

aggaatgtgg aagggccttt agtcaaagtg catccctcag tacacaccag agaatccata 180 

ctggtgagaa accctttgaa tgtgaggaat gtgggaaagc cttcagacat cgctcatcac 240 

ttaatcagca tcatagaact cacactgggg agaaacccta tgtatgtgat aaatgtcaga 300 

aagctttcag ccagaacatt agcttggttc aacatttgag gactcattct ggagagaaac 360 

cttttacttg caatgaatgt gggaaaacct ttagacagat tagacacctt agtgaacata 420 

taagaattca taccg-gggag aagccctatg catgcactgc atgttgtaaa acctttagtc 480 

atagagcgta tctaacacat caccagagat ccatactggg gagagaccta caatgtaaag 540 

aatgtg 546 

<210> 92 
<211> 1386 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1076853 . 1 :2000MAY19 
<400> 92 

gcctttagtc gatgttcttc ccttgtccaa catgagagga ctcatactgg agagaaacct 60 
tttgaatgta gcatatgtgg gagggctttt ggtcagagcc catcccttta taaacatatg 120 
aggattcata agagaggcaa accttaccaa agcagtaact acagcataga tttcaagcac 180 
agcacatctc tcactcagga tgaaagcact cttaccgaag tgaaatccta ccattgtaat 240 
gactgtgggg aagactttag tcacattaca gactttactg accatcagag gatccatact 300 
gcagagaacc cctatgattg tgagcaggct tttagtcagc aagctatttc tcatcctgga 360 
gagaaaccct atcaatgtaa tgtatgtggg aaagctttca aaaggagtac aagtttcata 420 
gagcatcaca gaattcatac tggagagaaa ccctatgaat gtaatgagtg tggagaagcc 480 
tttagtcgac gctcatcgct tactcaacat gagagaaccc acactggaga gaaaccctat 540 
gaatgtattg actgtgggaa agcctttagt caaagttcat ctctcattca gcatgagaga 600 
actcatactg gagagaagcc ctatgaatgt aatgaatgtg ggagagcctt ccgaaaaaaa 660 
accaacctgc atgatcatca gagaattcat actggagaaa aaccctattc ttgtaaggaa 720 
tgtgggaaaa acttcagccg aagttcagct cttactaaac accagagaat tcatactcga 780 
aataaactct aggaaccgtg aaattaagga atttgcagaa tgctttagct aaaatgttct 840 
gattcaggat cagaggattc ttagagagct tgggaatgta atgaattacg tgtgtgttta 900 
tacgttgtgt gtggagaaaa ctgccagtag acagattttt ttttttttaa cataaagaca 960 
cacattctca gatctgatta cagactagtg taaaaacagc tacatgtatg tagctggttg 1020 
gggatgatat gcctgtatgt tggacttggc tttggaatat atgtatgcag gatatcatca 1080 
agtttcaaca tcttgacttg tgacccccaa tgtcaacagc ttttttaaaa agcaaattcc 1140 
tgcagtaatg accaaaaccc attttaaaaa ttgcttgaca actgcactca actggcagct 1200 
cttacattaa cttcaccatg gaaaccagtt ccaactccag gaagtcacca ttcaaagaat 1260 
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tagatcaatt agcccaacca cttcattggt ccagtatgaa gactggaaag ccaaagatgt 1320 
gacgtggttc ccacagtatg atacagccta taagggtaaa gctgggttaa aaatgcggtt 1380 
cctgga 1386 

<210> 93 

<211> 1039 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 481631 . 10 -.2000MAY19 
<400> 93 

acaggattga cttctaaaga ctcttggtac ctgaggaaga aacccagaag aggaagagga 60 
aagcaaagga gtcggggatg gctctttctc agggtctatt gacattcagg gacgtggcca 120 
tagaattctc tcaggaggag tggaaatgcc tggaccctgc tcagaggact ctatacagag 180 
acgtgatgct ggagaattat aggaacctgg tctccctgga tatctcttcc aaatgcacga 240 
tgaaggagtt cttgtcaaca gcgcaaggca acagagaagt gttccatgca gggacattgc 300 
aaatacatga aagtcatcac aatggagatt tttgctacca ggatgttgat aaagatattc 3 60 
atgactatga atttcaatgg caagaagatg aaagaaatgg ccatgaagca cccatgacaa 420 
aaatcaaaaa gttgacaggt attacagaac gatatgatca aagtcatgct agaaacaagc 480 
ctattaaaga tcagcttgga tcaagctttc attcgcatct gcctgaaatg cacatatttc 540 
agaccgaaga gaaaattgat aatcaagttg tgaagtctat ccacgatgct tccttggttt 600 
caacagccca aagaatttct tgtaggccca aaacccatat atctaataac catgggaata 660 
atttctggaa ttcttcatta ctcacacaaa aacaggaagt acacatgaga gaaaaatctt 720 
tccaatgtaa tgagagtggc aaagccttta attacagctc actcttaagg aaacatcaga 780 
taatccattt agcagacaaa tataaatgtg atgtatgtgg caagctcttt aatcagaagc 840 
gaaacctagc atgccatcgt agatgtcaca ctggtgagaa tccttacaag tgtaatgagt 900 
gtggcaagac cttcagtcag acgtcatccc ttacatgcca tcgtagactt catactggag 960 
agaaacctta caaatgtgaa gaatgtgaca aagctttcca tttcaaatca atacttgaaa 1020 
gacataggat aattcatac 1039 

<210> 94 
<211> 521 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID^No: LG : 1088431 . 2 : 2000MAY19 
<400> 94 

cggatctcgc ggggctactg gcgctctcgg cccacacaat atgacctcgg ggaggatgcg 
aggaagatga actgtgatga tccacttctt cttaatgaat gactgactta cctgagaaag 
aaactcagag gaagaggaaa gaaagaagag gagggaatgg ctctttctca gggactgttt 
acattcaagg atgtggccat agaattctct caagaggagt gggagtgcct ggaccctgcc 
cagagggcct tgtacaggga cgtgatgttg gagaactaca ggaacctgct ttctctcgat 
gaggataaca tccctccaga agatggttct caccttgcag cctgtggaca gagcacactg 
cctcttcctt agaatcctac aaaatccgac ccttttattt tactcccgta ccttttattt 
ctctcccttt tctagctttc tactctatcc gggaacttag ggagacctta gtcttcctga 
ttccatattc ttctaagatt tgcctctgaa agttgctttt g 

<210> 95 
<211> 1243 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 401619 . 10 :2000MAY01 
<400> 95 

gctcttcatc ctttaccaaa gagacaagca cttgaaaaaa gcaatggtac cagcgcggtc 60 
tttaacccca gcgtcttgca ctaccagcag gctctcacca gcgcacagtt gcagcaacac 120 
gccgcgttca ttccaacagg tatgtgccct tactgcccta cgtcctgtgc ccttctggtc 180 
atgtgctttc ttctcatttc tctaagctgt ttggtggcat ctagtttgct tttgaaggta 240 



60 

120 

180 

240 

300 

360 

420 

480 

521 
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taatacagtt tgaaattcat cgttgtccta gctatctaaa tgtatttacc ttactttgaa 300 
tgatagctaa agactgttag gattctaaag ccaaatattt gatagattga agagacagat 360 
ttaacccatg agaaacagca gttagggctt ttggtttctt gtatttgcac aagccctgta 420 
aaattgttta tgtaaataag accttttatg tgtgacaatt gaaatttgtc cttaactctg 480 
aatgacctaa aaatagcaat tccagtaaat actaaccatt tttttctatt tctattcaga 540 
gcactaaaac aatgaggcta ttcaaattaa agcaattctc tactcatatt tttatattca 600 
ttctatctct ttctccatcc ttctcaactt tcaccaagtt cacaagtata tagagctctt 660 
atcctcagtg tctaagccaa tgcctgatac tattacgtac gatgtgcatt aactatgatt 720 
ccactaaaag atccattgta atagtcatag aatcttagag tttaaaggac tcttagtgat 780 
ctcctcatcc agctgattgt tttacagatg agaaaactga ggccccctaa atgagaagtg 840 
actttccaag gtgccacaac taatgagaaa aagaactgag tttccctgtg accaaaccca 900 
tttacatcac attctaccac ctgggcccgc ctatatatac acattccaca gagttctcct 960 
gaaaaaagaa aaaagcagat aaaagtgaat ttttaaataa ctgaccccaa aaagtcagat 1020 
aaaagtaaaa aaacaaaagt ataaatcatg tcatccctcc cccatttgca ccgacatctc 1080 
taaccacaga cacacacacg cacaccatac gcaaagatag tcaccataat tgaccatgtt 1140 
tttcaccttt tagtcaatgt tagaagcaag gggtaactta agtgctgagt tgggagagcc 1200 
tccattgagt tctttgagag tcaacatttt cagcgccggt agt 1243 

<210> 96 

<211> 4338 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI: 1144007 .1:2000MAY01 
<220> 

<221> unsure 

<222> 2673, 3345, 4031 

<223> a, t, c, g f or other 

<400> 96 

cgtgttccga cccgctaggc cccgcgcggc tcggaatccg gcggcgctgt ttcggtcggg 60 
agtgggtggg agagaagccg gggcagggga ggagccgccg gagctgtcgg agccggccct 120 
tggaagaaaa tcctcgctgt gtccaggcbg aggcgggggg ctaatgacag tgtgagctct 180 
agatggtgtg agaccacccc aaagccaaga aatggctaca gccgtggaac cagaggacca 240 
ggatctttgg gaagaagagg gaattctgat ggtgaaactg gaagatgatt tcacctgtcg 300 
gccagagtct gtcttacaga gggatgaccc ggtgctggaa acctcccacc agaacttccg 360 
acgcttccgc taccaggagg cagcaagccc tagagaagct ctcatcagac tccgagaact 420 
ttgtcaccag tggctgagac cagagaggcg gacaaaggag cagatcctag agctgcttgt 480 
gctggaacaa tttcttaccg tcctacctgg agaactacag agctgggtgc ggggccaacg 540 
gccagaaagt ggcgaggagg cagtgacgct ggtggagggt ttgcagaaac aacccaggag 600 
accaaggcgg tgggtgactg tccatgttca cggccaggaa gtcctgtcag aggagacggt 660 
gcatttagga gcggagcctg agtcacctaa tgagctgcag gatcctgtgc aaagctcgac 720 
ccccgagcag tctcctgagg aaaccacaca gagcccagat ctgggggcac cggcagagca 780 
gcgtccacac caggaagagg agctccagac cctgcaggag agcgaggtcc cagtgcccga 840 
ggacccagac cttcctgcag agaggagctc tggagactca gagatggttg ctcttcttac 900 
tgctctgtca cagggactgg taacgttcaa ggatgtggcc gtatgctttt cccaggacca 960 
gtggagtgat ctggacccaa cacagaaaga gttctatgga gaatatgtct tggaagaaag 1020 
actgtggaat tgttgtctct ctgtccattc ccaatccccc agacctgatg agattctccc 1080 
agggttagac gaggaagagc ctggggtccc agatatccaa gagcctcagg agactcaaga 1140 
gccagaaatc ctgagtttta cctacacagg agataggagt aaagatgagg aagagtgtct 1200 
ggagcaggaa gatctgagtt tggaggatat acacaggcct gttttgggag aaccagaaat 1260 
tcaccagact ccagattggg aaatagtctt tgaggacaat ccaggtagac ttaatgaaag 1320 
aagatttggt tactgaatat ttcttcaagt gaatagtttt gtggaacctt cggggaaact 1380 
acacccagtc caccccctgt tagggaggca tcatgactgt tctgtgtgtg gaaagagctt 1440 
cacttgtaac tcccaccttg ttagacacct gaggactcac acaggaggag gaaaccctat 1500 
aaatgtatgg gaatgtggga aaaagttaca cacggaggct cacatcttgc cagggcacca 1560 
aaaggttcac aaggatgaac gcgccttaca aatatcccct aaaccggaag aattttggaa 1620 
gagacctccc ctgtgacaca ggctgagaga actccatcag tggagaaacc ctatagatgt 1680 
gatgattgcg ggaaa'gcact tccgctggac ttcagacctt gtcagacatc agaggacaca 1740 
tactggagaa aaacccttct tttgtactat ttgtggcaaa agcttcagcc agaaatctgt 1800 
gttaacagac acaccacaag aatccacctg ggaggcaaac cctacttgtg tggagagtgt 1860 
ggtgaggact tcagtgaaca caggcggtac ctggcgcacc ggaagacgca cgctgctgta 1920 
ggaactctac ctctgcagcg agtgcgggcg ctgcttcacc cacagcgcac ggatcgccaa 1980 
gcacttgaga cgcaacgcct cagtgaggcc ctgccgatgc aacgaatgtg gggaagagct 2040 
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tcagtcgcag ggaccacctc gtcaggcatc agagaacaca cactggggag aaaaccaatt 2100 
ttttcccaac gtgccctacc tgtggaaaaa gcttcagcag aggatatcac ttaattaggc 2160 
atcagaggac cccactcaga aaagacctcc tagctaggtc cccatgtgag gagatctgct 2220 
ttcagccctc acctaaggga ggtgaggaag aggaaaagcc ctcttgtcag cctgggaaga 2280 
ccttttcgag ggagtctccc tgacctgctc agate tgaca ttacctcttc ctgcaactaa 2340 
acacgagcct gggcagaacc tctcagcctt cctctacgcc ttgaggggca tgtttcatcc 2400 
aaagtacaac ctgaattgag gcttctcctt cacteggagt gcacctgcct ctaccatcat 2460 
gggtataaac gtaggagaac taagagactt aagagegteg tggttcctat ategtccaaa 2520 
aaataggctg ttacatatcc taaagactgc tcaacaagct tcaagttgaa aagtggccaa 2580 
aggacagccc ctttaaggtt tgggaagggg aegaagectg aagggattct gtctttactg 2640 
ggggtcaaat cttaaagcac acagttctgg ganttcaaga caggaeggtt tgcgtcctga 2700 
tgggctttgc cacacattca cagggataac ttgeatagat tccctcgctg tcttgattca 2760 
cttcttacca tgcactttcc tttgatgctg aggagaaatg gaagtgggcg aaaaatctca 2820 
aggctgette atgtggacct tggtcaagct ggctccctcc cccagcgtca aattgttatc 2880 
aggtgccaaa cactgetaga aagggggggc ctaggtcaga agcctctttc catacgagtt 2940 
ttggttttgt ttttaatatt tttttcctat taaaatactt catgeattta accttcccgt 3000 
tattcaacca gtctcttggg ttgcatccct agcacttctc actacaagtg agatggtagt 3060 
gtttgagtgc ttattgagta aagcataatt eggtcataat gaaatcgttc acattccctc 3120 
atatgeacaa gcccaccaac cccttcacac cccccttcac aggggtegta tgagtaaggg 3180 
gatttggaaa ctgtcaacct tacaaaggca ctataacaat tacagaatca tgattgecat 3240 
gggccacttt atttacatga agacaactgg gagaacgact aagaccaaat tatggaaaat 3300 
aagaaaaagc tgttgctggg caagaccatc aagactgttc tgacncaccc tgtccccatc 3360 
atcccgtgac tgagtactgt gaeatcaegg aaagtgttga acctgggaac ctgtagagga 3420 
ttctcccaca ggattaattg cgcttcaatg tatttgtttt gtttgctttt tcttacgtga 3480 
ttttatgttc atagagctag aaagtagcat ctcatgatgg cccaacaatc tctgttgcca 3540 
gttaaaggtt ccttggagat gaggctgaat aattatgaac ctcacccttc tctgattgtg 3600 
ggagtggcaa gaactgggga gaccgtcctc tccatagtgg agcacagggt atggggttaa 3660 
agcatggaac aggggagagt cttctgtgcc tggtttcttc tcctctatct ecataatgea 3720 
ttatgggccc gaggaatagg ggagggttaa taagactcca accctaatgg cccaacaggg 3780 
aaattctcat tttggtcgat gatattctga tggactggtt tggtcttaat accagtcaac 3840 
cgttgtcctt ctggaaatat acatatatga aataacataa eggtaacact tggcagccaa 3900 
gttccctggt ttctgggact tcccatctta cccattcctt ttccagggct tccagtgtcc 3960 
tgatacttct ggagggtggt tcatactcca aattagattc tggggagtta cagagtaatt 4020 
ttttccttgg nagggaaagg gaagggttgg gggatggatt taggcagagt tccggtggaa 4080 
acattatttg cactctgaga taagatccaa gcctggagtt tgcagaagat actgtcctaa 4140 
taagcaggca tttctaaacc aagtatctaa gcctaagcac agcttgtcct gggtgaaatg 4200 
tctgccacaa aagatagttt ctcctagctc agacttaacc atttataaag gttggtaaaa 4260 
tactggcagt gacaacaaat tgacttttta attttcttat ttgeattatt ccaataaatg 4320 
aaaatctgtc agagttcg 4338 

<210> 97 

<211> 1827 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 331074 . 1 :2000MAY01 
<220> 

<221> unsure 
<222> 32, 57 

<223> a, t, c, g, or other 
<400> 97 

cctacaccat ggaggtgctg ctgeagaate aneggctgeg ggaccacaat ateeggnegg 60 
gcgaggatag atgtgctcac gcaagaaggc tgagtttatc aagggcagtc acaagtgcaa 120 
cgtttgttca eggactttet teteggagaa egggctaegg gagcacctgc agacgcaccg 180 
gggccctgcc aagcactaca tgtgtcccat ctgtggtgag cgcttccctt cgctgctgac 240 
gctcaccgaa cacaaggtga cccacagcaa gagectggae acgggcacct gtcgcatctg 300 
caagatgccc ctgcagagcg aggaggagtt tattgagcac tgccagatgc accctgacct 360 
gcgcaactca ctcacgggct tccgctgtgt ggtctgcatg cagacagtca cttccacgct 420 
tgagctcaag atccatggca ccttccacat gcagaagctg gegggcaget cagcggcgtc 480 
ctcccccaat ggecagggge tgeagaaget ctacaagtgc gccctgtgcc tcaaggagtt 540 
ccgcagcaag caggacctgg tgaagcttga cgtcaatggg ctgccctacg gcctctgcgc 600 
eggctgeatg gcccgcagag ccaacggaca ggtgggtggc ctggccccgc ccgagcccgc 660 
cgaccggccc tgtgccggcc tccgttgccc cgagtgcagt gtcaagtttg agagtgeega 720 
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agacctggag agccacatgc aggtggacca ccgtgacctc acgccggaga ccagtgggcc 780 
ccggaaaggc acccagacat cgccagtgcc ccggaaaaag acataccagt gcatcaagtg 840 
ccagatgacc ttcgagaacg agagagagat ccaaatccac gatgccaacc acatgattga 900 
ggaaggcatc aaccacgagt gtaagctgtg caaccagatg ttcgactccc cggccaagct 960 
cctctgtcac ctcattgagc acagcttcga gggcatgggc ggcaccttca aatgccccgt 1020 
gtgtttcaca gtcttcgtcc aggccaacaa gttgcagcag cacatctttg ccgtgcacgg 1080 
gcaggaggac aagatctacg actgctcaca gtgccctcag aagttcttct tccagaccga 1140 
gctgcagaac cacacgatga gccagcacgc acagtgaggg atcgctcaac aggacacctc 1200 
tccgcagaag gcttgccgga gacgccgtgg ggagggccat ttgaacatta catccaatca 1260 
aagtgtcatt tgcaacccag atgtaaaact ctaatgattt ggccatgagg cgctgctatt 1320 
ataagcagct ggaaatgaat attaatggca gagattaaaa gtattccatg ctcagtattt 1380 
tttattgtcc tgctacagct agtgtgcttt tagactttcc gccgcagact acatttctag 1440 
agttagagaa acctgctttt taaggctatt gtcctttgtt ccttcatgta ttatattgat 1500 
agtttttaaa aaagaattag tgtgattttt tttctttgct tctttttttt ctttcttgtt 1560 
tttcttcccc cccaccccct acccccttcg gttaactact ttttaattgc aattctaggt 1620 
aattgtgcat cgtgatgtga ttgcttggct attgtctgaa tatttccttt taatttttta 1680 
attaaagact aatgctttga ttggatttgc cagttcaccg gacagtgatt aaaactatgt 1740 
aatgaatata atcggtttca gtgcaactgg atggtctgct tttaaatgtg acttaatctg 1800 
actgcagtaa ctagtacagt tcaataa 1827 

<210> 98 

<211> 825 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 1170349 . 1 : 2000MAY01 
<400> 98 

gccgctgctt caggtcggcg ggctccaggg agacctgggt gggcagcgtc gccgtttctc 60 
ctttcttggg cagtattttt cccagcgcca cgcggaggct gggccattat gagctctgca 120 
tttccaggac ctggtcacta ttcaggacac ggttccagcg cagtgcgcgt tagccatgtc 180 
tcagggatga gtgacattcc aagatgtggc cattgacttc tccaaggaag agtggggatt 240 
cctgaaccct gctcagagag atttgtacac aactgtgatg ctggagaatt atcagaacct 300 
ggtctggctg ggactttcca tttctaaatc tgtgatttca ctgttggaga aaaggaaact 360 
gccttggata atggcaaaag aagagataag aggcccattg ccagatgtgc caggtgcaga 420 
gattaaggag ttatctgcaa agagggctat taatgaagta ttatcgcagt ttgacacagt 480 
gataaaatgt acaagaaacg tatgtaagga atgtggaaat ctatactgcc acaatatgca 540 
gcttactctc cataagagaa atcatacaca aaagaaatgc aatcagtgtt tagattgtgg 600 
gaaatacttc actcgtcaat cacctctcat tcagcatcaa agaatccaca cgggagagag 660 
accctataaa tgtaacgaat gtattaaaac cttcaaccag agggcacacc ttacctagca 720 
tgagagaatt cacactggtg agaaacctta caaatgtaag gaatgcagga aaaccttcag 780 
ccagatgact catctcacac agcatcagac tacacatatg agaga 825 

<210> 99 
<211> 1100 
<212> DNA * 
<213> Homo sapiens 

<220> 

<221> mi sc_f eature 

<223> Incyte ID No: LG:335097 . 1:2000FEB18 
<400> 99 

tcatctgtgc tgtgaattat gtgctgtatt gactctcaag catgatgacc attggtggtt 60 
tctgtaccat ttcctgttac tttactgaaa cacacctact ccattaactt cttgggttaa 120 
gtctagaaag taacagttta cttgtaaacc acatttctta tccccaataa gtattttttt 180 
aagattatta aagttcatta ttactaccct atgatgtgaa agtgtcattt gcttaatctt 240 
tttaattttt tattctcaac ctcatcttac tgaagagaat aaaactcttt taccatattc 300 
ttaaaatgtg gaattctcgg ccaggtgcag tgctcacgcc tgtaattcca tcactttggg 360 
aggccaaggt gggtggatca tctgaggtca ggagttcaag accagcctgg ccaacatggt 420 
gaaaccccgt ctctactaaa aatacaaaaa ttatctgggt gtggtggcgc gtgcctgtag 480 
gcccagctac tcaggaggct gaggcaggag aattgcttga acccaagagg tggaggttgc 540 
agtgagccta gattgctgcc actgcactcc agcctgggtg acagcagaac tctgtctcaa 600 
aaaaaagatg tggaattctt ttctgcaaat gttctctaat agtatacctt cttcagtctg 660 
tcgatatatg tatgctatta ttttacaagt aatacatgtt gattgtattg gaaattatag 720 
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aaaagattat attggattgt ttagaaaata tttttaaatg tgaagaaaaa tataaaaatt 780 
actcccttgt tccactttcc ccactctcaa gtcagactat gttgttttca tagttagtag 840 
ctagcagtct accccactag attatatgct tcacagaggg aagggaccct caagacttca 900 
ctggattgag tagcacccaa taccttgctt gctgcctggt ttgtgatggg catactgtaa 960 
gaaaaaaaaa tctgaatgac aaaatgtttt tccataatac cagacttcct cttgaagaga 1020 
tgggtcgtaa tgttgtagtc ttacatgctt acgtagacga tcaaagcaag aatactcaat 1080 
aaatggctat ttaccacttg 1100 

<210> 100 
<211> 584 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_J:eature 

<223> Incyte ID No: LG: 1076451 . 1 :2000FEB18 
<400> 100 

gtgtacttac tttctgccat 
taatcttatt actattaatg 
ccccattttt cttctgtttt 
tcctcagttc ttaatcttat 
cacacattag gttggtcatt 
caagttcttt ttagagttct 
accttttgtc ctacctcagt 
aggtcagttg aagtccttac 
agttttctca tgcttcagcc 
gcttgtcgac ttcttcagag 

<210> 101 
<211> 392 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: LI : 805478 . 1 :2000FEB01 

<220> 

<221> unsure 
<222> 21 

<223> a, t, c, g, or other 
<400> 101 

gccaacccct gcaaagtgac nctgaagaag acgacaagct ctgctccagt cacacccaga 60 
agctgactgg tccacgcacg gccgaagcat gagaaaactc attgcggaac tcattttcct 120 
taaaatttgg acttttacag taaggacttc aactgacctt cctcagactg aggactgttc 180 
cagtgtatac atcaagtcac tgaggtagga caaaaggttg ctacagtcct attattttat 240 
ggttattata agtgtactgg aactctaaaa ataacttgtt tgtataatgt tattctgtac 3 00 
aaggtatgta gcccaggaag tgaccaacct gatgtgtgtt atgacccatc tgagcttccc 3 60 
atgaccacgg tttttaaaat aagattaagg ac 392 

<210> 102 
<211> 856 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> IncytelDNo: LG: 101269 . 1 : 2000MAY19 
<400> 102 

ctctccttta ttattaacta aagagcattc tctaccattg cccagctgaa 
tcatagattc tctgtgggga gagtgaataa agaatggggg attgaggcac 
aatagtgaga cacaggattc taactggcag actggcactt tatctcaatt 
gtgttgcctc tcactttaaa gacatccaca ttccaccctg gccttgcccg 
ccaatctctg ctggtcagtc acgcagttga ggtcagggaa ggagtaactc 



tatatcctct ttcttaataa 
acagcacagg catcagattt 
ggctaacact ttactcgtat 
ttcaaaaact gtggtcatgg 
tcctgggcta tgtaccttgt 
agtacactta taataaccat 
gacttgatgt atacactagg 
tgtacaagtc caaatgttaa 
gtgcgtggac cagtcagctt 
ttactttgca ggagttggcg 



agggaaccac atcatattcc 60 
caaggtgact tgtatgggca 120 
tgtttatgag cccccatcag 180 
gaggctcaga tgggtcataa 240 
atagaataac attatacaag 300 
aaaataatag gaccatagca 3 60 
aacagtcctc agtccaagga 420 
ggaacatgag tcccgcgatg 480 
ccgggtgtga ctggagcagg 540 
aagc 584 



agcacctagc 60 
agagtgttaa 120 
cttaactgct 180 
tctggtgagc 240 
aagatgattt 300 
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tgatgttgca gttttaaatt tcactttgac ttgagctggg aaatcctttg gcctattata 360 
ccatggatgc taaaaatggt tctaactgaa aacccaaacc aagaaatagc aacaagtcta 420 
gaattcttac tactacaaaa ctcacctgga tccctaaggg cacagcaaag aatgagctat 480 
tacggcagca gctatcatat tatcaatgcg gacgcaaaat acccaggcta cccgccagag 540 
cacattatag ctgagaagag aagagcaaga agacgattac ttcacaaaga tggcagctgt 600 
aatgtctact tcaagcacat ttttggagaa tggggaagct atgtggttga catcttcacc 660 
actcttgtgg acaccaagtg gcgccatatg tttgtgatat tttctttatc ttatattctc 720 
tcgtggttga tatttggctc tgtcttttgg ctcatagcct ttcatcatgg cgatctatta 780 
aatgatccag acatcacacc ttgtgttgac aacgtccatt ctttcacagg ggcctttttg 840 
ttctccctag agaccc 856 

<210> 103 
<211> 856 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 331087 . 1 :2000MAY01 
<400> 103 

ccctctctct ttctgccctg tagccgcggg cgtccaaatg aagctggaat tcctccagcg 60 
caaattctgg gcggcaacgc ggcaggatct ggactgcttc gtcatcgaca acaacgggtt 120 
cattctgatc tccaagaggt cccgagagag tgactatgta tgactatcag gccatgtgca 180 
aaccctcgag tcaccaccac agtgcagccc agcecctggt cagcccaatt tctgccttct 240 
tgacggcgac caggtggctg ctgcaggagc tggtgctgtt cctgctggag tggagtgtct 3 00 
ggggctcctg gtacgacaga ggggccgagg ccaaaagtgt cttccatcac tcccacaaac 360 
acaagaagca ggacccgctg cagccctgcg accacggagt accccgtgtt tcgtgtacca 420 
gccggccatc cgggaggcca acggaatcgt ggagtgcggg ccctgccaga aggtatttgt 480 
ggtgcagcag attcccaaca gtaacctcct cctcctggtg acagacccca cctgtgactg 540 
cagcatcttc ccaccagtgc tgcaggaggc gacagaagtc aaatataatg cctctgtcaa 600 
atgtgaccgg atgcgctccc agaagctccg ccggcgacca gactcctgcc acgccttcca 660 
tccagaggtg cgggttgagg cggatcgagg gtgggctgga ttttcatccc caaaccctct 720 
gtgcctgggt ctgtgcccct gcagacagga gcatataggg atgccaatga acacacctgt 780 
gcctgtgctt ctcgggggaa acattcgcgt ttatgccctg tgacactgtg atataataag 840 
aaacagatct ttattt 856 

<210> 104 

<211> 8530 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI:410188 . 1 :2000MAY01 
<220> 

<221> unsure 

<222> 6797, 7046, 7252, 7311, 7325-7326, 7502-7503, 7563 
<223> a, t, c, g, or other 

<400> 104 

gccgctgagc ctgcgcccag tgccccgagc 
cagccgccca ccccggggcc cggccggggg 
agagcctgtg cccagaagca ggatgagaag 
agcagcttcc gcaggttcac acgggagtcc 
aagcaagccc gcggctcaac caccttgcag 
gctccccggc cccagctgga cctgcaggcc 
ccaccccaag age teat egg agagcccctg 
aagactttca tegtactgaa taaaggcaag 
ttgtatgtcc tcagtccctt ccacccagtt 
tcgctcttca acatgetcat catgtgcacc 
cacgaccctc caccctggac caagtatgtc 
gagtctctgg tcaagattct ggctcgagct 
gacccatgga actggctgga ctttagtgtg 
gacctgggca atgtctcagc cttacgcacc 
tcagtcattt cagggctgaa gaecategtg 



cccgcgccga gccgagtccg cgccaagcag 60 
accagcagct tccccacagg caaegtgagg 120 
atggcaaact tcctattacc tcggggcacc 180 
ctggcagcca tegagaageg catggeggag 240 
gagagecgag aggggctgee cgaggaggag 300 
tccaaaaagc tgecagatet ctatggcaat 360 
gaggacctgg accccttcta tagcacccaa 420 
accatcttcc ggttcagtgc caccaacgcc 480 
eggagagegg ctgtgaagat tctggttcac 540 
atcctcacca actgcgtgtt catggcccag 600 
gagtacacct tcaccgccat ttacaccttt 660 
ttctgcctgc acgcgttcac tttccttegg 720 
attatcatgg catacacaac tgaatttgtg 780 
ttccgagtcc tccgggccct gaaaactata 840 
ggggccctga tccagtctgt gaagaagctg 900 
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gctgatgtga tggtcctcac agtcttctgc 
ctcttcatgg gcaacctaag gcacaagtgt 
aacggctccg tggaggccga cggcttggtc 
ccagaaaatt acctgctcaa gaacggcacc 
gacgctggga catgtccgga gggctaccgg 
ggctacacca gcttogattc ctttgcctgg 
caggactgct gggagcgcct ctatcagcag 
atcttcttca tgcttgtcat cttcctgggg 
gtggtcgcaa tggcctatga ggagcaaaac 
gaaaagcgct tccaggaggc catggaaatg 
aggggtgtgg ataccgtgtc ccgtagctcc 
agccatgaga gaagaagcaa gaggagaaaa 
gaggacaggc tccccaagtc tgactcagaa 
ctcacccgtg gcctcagcag gacttctatg 
acctttcgca ggcgagacct gggttctgaa 
gcgcgggaga gcgagagcca ccacacatca 
agtgcccagg gacagcccag tcccggaacc 
aagaacagca ctgtggactg caatggggtg 
gccacatccc caggaagcca cctcctccgc 
accacgccat cggaggagcc aggcggcccc 
gatggcttcg aggagccagg agcacggcag 
agcgcactgg aagagttaga ggagtctcgc 
gcccagcgct acctgatctg ggagtgctgc 
aagttggtgg tcatggaccc gtttactgac 
acactcttca tggcgctgga gcactacaac 
gtcggaaacc tggtcttcac agggattttc 
ctcgacccct actactactt ccaacagggc 
cttagcctca tggagctggg cctgtcccgc 
cgcctgctgc gggtcttcaa gctggccaaa 
atcatcggga actcagtggg ggcactgggg 
ttcatctttg ctctggttgg caagcagctc 
aatatctccg cgccccatga agactggccc 
ttcctcattg tcttccgtat cctctgtgga 
gaagttggcc aaaaatccat atgcctcatc 
ctggtggtgc ttaacctgtt catcgccctg 
acagccccgg aggacgatgg ggaggtgaac 
aggggcctgc gctttgtcaa gcggaccacc 
cggcctcaga agcccgcagc ccttgccgcc 
ccctactccc cgccaccccc agagacggag 
tttgaggaag gcgagcaacc aggccagggc 
cccatcgctg tggccgagtc agacacagat 
ggcacggagg aggagtccag caagcagcag 
ggccctccgg attccaggac ctggagccag 
gccagtgcat ctcaggccga ctggcggcag 
tgcggtgaga ccccagagga cagttgctcc 
gctgagctcc tggagcagat ccctgacctc 
ttcactgaag gctgtgtccg gcgctgtccc 
gggaaggtct ggtggcggtt gcgcaagacc 
gagacattca tcatcttcat gatcctactc 
tacctagagg agcggaagac catcaaggtt 
tatgtcttcg tgctggagat gctgctcaag 
accaatgcct ggtgctggct cgacttcctc 
gccaacaccc tgggctttgc cgagatgggc 
ctccgtcctc tgagagctct gtcacgattt 
gtgggcgcca tcccgtccat catgaacgtc 
ttcagcatca tgggcgtgaa cctctttgcg 
gatggagagt tttcccttgt acctttgtcg 
caaaactcca ctggcagctt cttctgggtc 
atgggttacc ttgcacttct gcaggtggca 
gcagctgttg attcccggga ggtcaacatg 
tatttgtact ttgtcatctt catcattttt 
ggggtcataa ttgacaactt caatcaacag 
atgacagagg agcagaagaa atactacaat 
cagaagccca tcccacggcc cctgaacaag 
agacaagctt ttgacatcac catcatggtc 
ggtggagact gatgaccaaa gtgaagaaaa 
ctttgtggcc gtcttcacag gcgaatgtgt 



ctcagcgtct ttgccctcat cggcctgcag 960 
gtgcgcaact tcacagcgct caacggcacc 1020 
tgggaatccc tggaccttta cctcagtgat 1080 
tctgatgtgt tactgtgtgg gaacagctct 1140 
tgcctaaagg caggcgagaa ccccgaccac 1200 
gcctttcttg cactcttccg cctgatgacg 1260 
accctcaggt ccgcagggaa gatctacatg 1320 
tccttctacc tggtgaacct gatcctggcc 1380 
caagccacca tcgctgagac cgaggagaag 1440 
ctcaagaaag aacacgaggc cctcaccatc 1500 
ttggagatgt cccctttggc cccagtaaac 1560 
cggatgtctt caggaactga ggagtgtggg 1620 
gatggtccca gagcaatgaa tcatctcagc 1680 
aagccacgtt ccagccgcgg gagcattttc 1740 
gcagattttg cagatgatga aaacagcaca 1800 
ctgctggtgc cctggcccct gcgccggacc 1860 
tcggctcctg gccacgccct ccatggcaaa 1920 
gtctcattac tgggggcagg cgacccagag 1980 
cctgtgatgc tagagcaccc gccagacacg 2040 
cagatgctga cctcccaggc tccgtgtgta 2100 
cgggccctca gcgcagtcag cgtcctcaca 2160 
cacaagtgtc caccatgctg gaaccgtctc 2220 
ccgctgtgga tgtccatcaa gcagggagtg 2280 
ctcaccatca ctatgtgcat cgtactcaac 2340 
atgacaagtg aattcgagga gatgctgcag 2400 
acagcagaga tgaccttcaa gatcattgcc 2460 
tggaacatct tcgacagcat catcgtcatc 2520 
atgagcaact tgtcggtgct gcgctccttc 2580 
tcatggccca ccctgaacac actcatcaag 2640 
aacctgacac tggtgctagc catcatcgtg 2700 
ctaggggaaa actaccgtaa caaccgaaaa 2760 
cgctggcaca tgcacgactt cttccactct 2820 
gagtggattg agaacatgtg ggcctgcatg 2880 
cttttcttga cggtgatggt gctagggaac 2940 
ctattgaact ctttcagtgc tgacaacctc 3000 
aacctgcagg tggccctggc ccgcatccag 3060 
tgggatttct gctgtggtct cctgcggcac 3120 
cagggccagc tgcccagctg cattgccacc 3180 
aaggtgcctc ccacccgcaa ggaaacacag 3240 
acccccgggg atccagagcc cgtgtgtgtg 3300 
gaccaagaag aggatgagga gaacagcctg 3360 
gaatcccagc ctgtgtccgg ctggcccaga 3420 
gtgtcagcga ctgcctcctc tgaggccgag 3480 
cagtggaaag cggaacccca ggccccaggg 3540 
gagggcagca cagcagacat gaccaacacc 3600 
ggccaggatg tcaaggaccc agaggactgc 3660 
tgctgtgcgg tggacaccac acaggcccca 3720 
tgctaccaca tcgtggagca cagctggttc 3780 
agcagtggag cgctggcctt cgaggacatc 3840 
ctgcttgagt atgccgacaa gatgttcaca 3900 
tgggtggcct acggcttcaa gaagtacttc 3960 
atcgtagacg tctctctggt cagcctggtg 4020 
cccatcaagt cactgcggac gctgcgtgca 4080 
gagggcatga gggtggtggt caatgccctg 4140 
ctcctcgtct gcctcatctt ctggctcatc 4200 
gggaagtttg ggaggtgcat caactatacc 4260 
attgtgaata acaagtctga ctgcaagatt 4320 
aatgtgaaag tcaactttga taatgttgca 4380 
acctttaaag gctggatgga cattatgtat 4440 
caacccaagt gggaggacaa cgtgtacatg 4500 
ggaggcttct tcacactgaa tctctttgtt 4560 
aaaaaaaagt tagggggcca ggacatcttc 4620 
gccatgaaga agttgggctc caagaagccc 4680 
ttccagggtt ttgtctttga catcgtgacc 4740 
ctcatctgcc taaacatggt tcaccatgat 4800 
gacgaaaatt ctgggcaaaa tcaaccagtt 4860 
catgaagatg ttcgctttga ggcagtacta 4920 
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cttcacaaat ggctggaatg tgtttgactt 
gattttttct gcaattctta agtcacttca 
catccgcctg gcccgaattg gccgcatcct 
cacactgctc tttgccctca tgatgtccct 
attccttgtc atgttfcatct actccatctt 
ggaggctggc atcgacgaca tgttcaactt 
cttccagatt accacgtcgg ccggctggga 
gcccccctac tgtgacccca atctgcccaa 
cccagccgta ggcatcatct tcttcaccac 
caacatgtac attgcagtga ttctggagaa 
gcctctgagt gaggacgact ttgacatgtt 
ggccactcag tttattacct tttctgctct 
cctgagaatc ccaaaaccca atcgaaatat 
tggagataag atccactgct tggacatcct 
atccggggag ttggattctc tgaaggcaaa 
atccaagatc tcctacgagc ccatcaccac 
ggccatggtt atccagagag ccttccgcag 
ctccttcctc ttccgtcagc aggcgggcag 
agagggcctc atcgcctacg tgatgagtga 
cagctcctcc atctcctcca cttccttccc 
cagcgataac ctccaggtgc gggggtctga 
ccccccttct ccggacaggg accgtgagtc 
gacacactga aaagcagcct ttttcaccat 
agcctggggc cttcctggct ttgggagtaa 
ggcagagtte tgtggcgccg cgtggacagc 
agatagacct gtgacctggt ctggtcaggc 
catcccagct gctgaggcga aatataaaac 
ataaatttat tatatttgat atttttttac 
acatgggcag caattcacgc tgtctcttct 
ccggaagtcc tgttctcaaa gcagaagtgg 
gcccaggggt cgaatggggt ccccctccca 
cccactcaca caagcanaca cacacagtcc 
gacccaggct cccagcctaa gggagacagg 
ttcttgtcca cggggctcac tctggccccc 
tgtgttttca cgcaggtcat attgtcagtc 
tggcctgggg tcccagggct gggccntagg 
taagagtatt aacaaaacca aaggacacaa 
cagcttgtcc ttgctcctgg aacctggcag 
get gage cat gggggtttgg ggctaagaag 
ctgcctgaag agaggaaact ggcgatctcc 
tttcnngtgt ggtctccagc tcctctccag 
ttggccctgg ccctgtgcag ggaaagggat 
gggaatgaac catggcacct ttgagagagg 
cnnagtcccg ggatcctgat ggcacccaca 
ggntcccaba ggtcttgtga aaaggctttt 
cccccaggac ctcttcagct gctgacaatc 
gagaactgtc accctgaggt aacagggtca 
tggctgcccc attccagctc tcccacggag 
acctcagtct cacctgccgg ctctgggctg 
ggagggctgg acatttgtgg cagtgctgaa 
tttcttcttg tcaccccagt tcccttggtg 
cagatctagt tctcttctcc tgtgttccaa 
ctgtccaagc agagatttct ccaagtggaa 
gctctcaagg cttggagagg tatgtcccac 
ggcaggtaat gcggattgga ggcaggtegg 
gtggtttttg tgtgtgtgga cagcccggga 
cagacggtgt ggttgggggt gggaggtgag 
tgtcaccttt ttaatttcca gaactgeaca 
aaaaaggagg gaaggaaaga catcaagtgc 
tcaccaaact tcatgtataa ataaaataca 
catgacctgg 

<210> 105 

<211> 2965 * 

<212> DNA 

<213> Homo sapiens 



cattgtggtg gttctctcca ttgegagect 4980 
aagttacttc tccccaacgc tcttcagagt .5040 
cagactgatc cgagcggcca aggggatccg 5100 
gcctgccctc ttcaacatcg ggctgttgct 5160 
eggtatgtec agctttcccc atgtgaggtg 5220 
ccagaccttc gccaacagca tgctgtgcct 5280 
tggcctcctc agccccatcc tcaacacagg 5340 
cagcaatggc accagagggg actgtgggag 5400 
ctacatcatc atctccttcc tcatcgtggt 5460 
cttcaatgtg gecaeggagg agagcactga 5520 
ctatgagacc tgggagaagt ttgacccaga 5580 
cteggacttt gcagacactc tctctggtcc 5640 
actgatccag atggacctgc ctttggtccc 5700 
ttttgettte accaagaatg tcctaggaga 5760 
tatggaggag aagttcatgg cagccaaccc 5820 
cacactccgg cgcaagcacg aagaggtgtc 5880 
gcacctgctg caacgctctt tgaagcatgc 5940 
cggcctctcc gaagaggatg cccctgagcg 6000 
gaacttctcc cgaccccttg gcccaccctc 6060 
accctcctat gacagtgtca ctagagccac 6120 
ctacagccac agtgaagatc tcgccgactt 6180 
catcgtgtga gcctcggcct ggctggccag 6240 
ggcaaaccta aatgeagtea gtcacaaacc 6300 
gaaatgggcc tcggccccgc ggatcaacca 6360 
eggagcagtt ggcctgtgct tggaggcetc 6420 
aatgcccctg cggctctgga aaagcaactt 6480 
tgagactgta tatgttgtga atgggctttc 6540 
ttgagcaaag aactaaggat ttttccatgg 6600 
taaccctgaa caagagtgtc tatggagcag 6660 
aatccagtgt ggctcccaca ggtcttcact 6720 
cttgacctga gatgctggga gggctgaacc 6780 
tcacacacgg aggecagaca caggccgtgg 6840 
cctttccctg ccggcccccc aaggatgggg • 6900 
tattgtctcc caaggtccca ttttccccct 6960 
ctacaaaaat aaaaggcttc cagaggagag 7020 
cactgatagt tgecttttet tcccctcctg 7080 
gggtgcaagc cccattcacg gcctggcatg 7140 
gccctgccag ccagccaatg gaagagaggg 7200 
ttcaccagcc ctgagccatg gnccctcagc 7260 
cagggctctc tggaccatac neggaggagt 7320 
acacagagac atgggagtgg ggagcggacg 7380 
ggtcaggccc agttctcgtg ccccttagag 7440 
gggcactgtg gtcaggccca gcctctctgg 7500 
cagaggacct ctttggggca agatccaggt 7560 
tcagggaaaa atattttact agtccaatca 7620 
ctatttagca tatgeaaate ttttaacata 7680 
actggcgaag ctgaggcagg ccagggggct 7740 
cccctccaac egggegcatg ctcccaggcc 7800 
gctgctccta acctacctcg ccgagctgtc 7860 
gggggcattg ccggcgagta aagtattatg 7920 
gcaaccccag acccaaaccc atgcccctga 7980 
tttgagtcca agtgtgggac aaggttttaa 8040 
atccatattt ttgtgagatc tecatgettt 8100 
ctccatgggt tgctcaccgc ctgctacaca 8160 
gctgccagcc cagctggccg gaaggagact 8220 
gctttgagac aggtgcctgg gggctggctg 8280 
ctagacccaa cccttagctt ttagcctggc 8340 
atgaccagca ggaggggaga agagagtagg 8400 
cagatgttgt ctgaactaat cgagcacttc 8460 
tatttttaaa acaaaccaat aaatggctta 8520 

8530 
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<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 1188288 .1 :2000MAY01 
<400> 105 

ctgaggtggc gtgggcgccc tgcgggccgg gcctccgcgg cgcgtccagt cccagtctcc 60 
gcgctggtct ggcccggtcc ctcagctcca ccccctgcct ccccctcccg cccgctcgcg 120 
cctcctgaaa agccagctcg cccggggcgg cggcgcagag ccgggccggc gcacgaggca 180 
gccagcgcgg ccatgacggc ggagaaggcg ctgcccctgg gcaatgggaa ggctgctgag 240 
gaggcgcggg agtccgaggc gccgggtggc ggctgcagca gcgggggcgc ggcgcccgcg 300 
cgccacccgc gcgtcaagcg cgacaaggcg gtccacgagc gcggccactg gaacaacaag 360 
gtggagttcg tgctgagcgt ggccggggag atcattgggc ctgggcaacg tgtggcgctt 420 
cccctacctt gtgctacaag aacggaggag gggcattcct gattccctac gtggtgtttt 480 
ttatttgctg tggaattcct gtttttttcc tggagacagc tctggggcag ttcacaagtg 540 
aaggtggcat tacgtgttgg aggaaagttt gccctttatt tgaaggcatt ggctatgcaa 600 
cacaggtgat tgaggcccat ctgaatgtgt actacatcat catcctggca tgggccattt 660 
tttacctgag caactgcttc actactgagc taccctgggc tacctgtggg catgagtgga 720 
acacagagaa ttgtgtggag ttccagaaac tgaatgtgag caactacagc catgtgtctc 780 
tgcagaaatg ccacctcccc tgtcatggag ttttgggagc accgggtcct gggcatctct 840 
gacgggatcg agcacatcgg gaaccttcgc tgggagctgg ccttgtgtct cttggcagcc 900 
tggaccatct gttacttctg tatctggaag gggaccaagt ctacaggaaa ggttgtatac 960 
gtgactgcga cattccccta catcatgctg ctgatcctcc tgatacgagg ggtcacgttg 1020 
cccggggcct cagagggcat caagttctac ttgtaccctg acctctcccg gctctccgac 1080 
ccccaggtct gggtagatgc tggaacgcag atctttttct* cctatgccat ttgcetgggc 1140 
tgtctgaccg ctctgggaag ttataacaat tataacaaca actgctacag ggactgcatc 1200 
atgctctgtt gcctgaacag cggcaccagc tttcgtggct gggtttgcca tcttctcagt 1260 
cctgggtttt atggcgtacg agcagggggt acccattgct gaggtggcag agtcaggccc 1320 
cggtctggcc tttattgcgt accccaaggc ggtcaccatg atgcccctct ccccgctgtg 1380 
ggccaccttg ttcttcatga tgctcatctt cctgggcctg gacagccagt ttgtgtgtgt 1440 
ggaaagcctg gtgaccgccg tggtggacat gtaccccaat ggttttcccg gaggggttac 1500 
cggcgggagc tgttcatcct agccttgtct gttatctcct attttctggg cctcgtgatg 1560 
ttaacagagg gtggcatgta catcttccag ctctttgact cctatgccgc cagtgggatg 1620 
tgccttctct tcagcggcca tctttgagtg catctgcatc ggctgggtgt atgggaagca 1680 
accggttcta tgataacatt gaagacatga ttggctaccg gccaccgtcg ctcattaagt 1740 
ggtgctggat gatcatgacc cctgggatct gacgacgggg atcttcatgc ttgcttcttg 1800 
atcaaggtac aagaccactg caaggtacaa caacatctac acctacccag cctggggcta 1860 
atggcattgg ctggctcatg ggccctgtcc tccatgctct gcatcccgct ctggatctgc 1920 
atcacagtgt ggaagtacgg aggggacgct gcccgagaaa ctccagaagt tgacgacccc 1980 
cagcacagat ctgaaaatgt cggggcaaga cttggggtga gcccacggat ggtgacagtt 2040 
aatgactgtg atgccaaact caagagtgac gggaccatcg cagccatcac agagacagga 2100 
gaacgccact tctgagcggg ccaaaccaaa gccatcttgg cggctccttc ttcctttctt 2160 
ccccccgtgt atgtaaatga attcctgaac cccatacttc acctaatggt aggggcttgc 2220 
ttgttttgca caggattaat taacaagtta attttaaggt ggccactgta cactgctcta 2280 
aagtcatatc ccctccccgc cccccagttc atcatggaag taaccactac aaagagataa 2340 
cactgtacgg ggactgccag atcttctgtc ctagggcagt tttaatcaat gttttctagg 2400 
gattaggaag aagtgtataa tattgtaaaa ctttgtatta ctgggggttg tgggccaggt 2460 
ggggtgggga gtgatggctg tcctcctcgt gtcggccttt gtaatgaccg ttctgtgtcc 2520 
cagtgtatgt ctctgaccat ccacagccgc ccctcctcca tacccccggg ataaattttc 2580 
cctcaaaccc tccccttcca caagggacac tggggttggc agacagcaag gctattgtgg 2640 
cagagaagaa aaagtccagc agactcagga ctcctgggct gcaggccccc ctctggcctg 2700 
atcccatgaa ggtgaccttg ggaactccct ttccctccct gggcctcagt ttccccctct 2760 
ccccaatgag aggaggagcc cgattggggt atttcttagg gatatctctt tgtgtgccta 2820 
tgattgcttc cagacctgcc gtggaccttg gcttccttct gaagacttgc aggccattat 2880 
ctccgccctc cccagttgta cctcctcccc acctgcttat ttaaaatcca gtttcttgtc 2940 
gaaacacaca caaaacacac acaaa 2965 

<210> 106 
<211> 2376 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 427 997 .4 :2000MAY01 
<400> 106 

cgcgtacagg aggcggcggc ggctcccagt caccggcccc cgacggcgag cgcacgatgc 60 
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actgcctggg cgccgagtac ctggtttctg ccgacggcgc ccctaggcaa agggagtggc 120 
gaccccagat ttataggaaa tgcacagata cggcatggtt attcctgttc tttctctttt 180 
ggactggttt ggtgtttatc atgggctact cggtggtggc tggagccgcg ggaagactcc 240 
tctttggcta tgacagcttt ggcaacatgt gtggcaagaa gaactccccc gtggaagggg 300 
cccctctttc agggcaggac atgaccctaa aaaaacacgt gttctttatg aattcctgca 360 
acctggaagt caaaggtacg cagctcaacc gcatggccct ctgtgtatcc aactgccctg 420 
aagagcagct tgactccctg gaagaggtcc agttctttgc aaacaccagt gggtccttcc 480 
tgtgtggtta tagtttgaat tccttcaact atacccacag tccaaaagca gactcactgt 540 
gtcccaggct accagttcct ccaagcaagt catttccctt atttaaccga tgtgtccctc 600 
aaacacctga gtgctactcc ctatttgcat ctgttttgat aaatgatgtt gacaccctcc 660 
accgaattct aagtggaatc atgtcgggaa gagatacaat ccttggcctg tgtatcctcg 720 
cattagcctt gtctttggcc atgatgttga ccgttcagat tcatacacca cccttctggt 780 
tcacattttc atttcattgg ttattttggg attgttgttt agtctgcggt gttttatggt 840 
ggctgtatta tgactatacc aacgacctca gcatagaatt ggacacagaa cagggaaaat 900 
atgaagtgcg tgctggggta tgctatcgta atccacaggg catcacggca gatgctgctc 960 
gtcttgatat tttgttctca gaaagagaat aaaattgaca agttgagctt ttccaaatca 1020 
ccaaataaag ccaatcagca gttggctccc ttcctgggct ggttccaagc caactgggtg 1080 
ggacgatttg gccatcctca ttgttcttcg tgggtcctct gggtggctgt gctgctgagc 1140 
cttgggaact gcagggagct gcccaggtta tgggaaggcg gccaagtgga atataagccc 1200 
ctttcgggca ttccggtaac atgtgggtcg taccatttaa ttggcctcat ctggactagt 1260 
gaattcatcc ttgcgtgcca gcaaatgact atatgctggg gcagtggtta cttgttattt 1320 
caacagaagt aaaaaatgat cctccgtgat catcccagcc gtttcgtctc gtccccattc 1380 
tcttcttcta ccatccaagg aaccattgtg aaagggtcat ttttaatctc tgtggtgagg 1440 
attccgagaa teat tgt cat gtacatggca acaacgcagc tgacaagata cagcateggt 1500 
geattegtae caggtacctg ttccggatga ctgettaegg ctgttcctgg tgtctggaca 1560 
aaatacctcg ctccatctca agecagaatg catatactac aateggecta ttacatggga 1620 
cagatttcgt gtacatcagc aaaagatgea ttcaaaatct tgtccaagaa ctcaagtcac 1680 
tttacatcta ttaatctget ttggagactt cataattttt ctaggaaagg tgttagtgct 1740 
gtgtttcact gttttttgga ggactcatgg cttttaacta caategggea tttccaggtg 1800 
tggggcagtc cctctgttat tggtagcttt ttttggccta cttagttagc ccatagtttt 1860 
tttatctgtg tttgaaactg tgctggatgc acttttcctg tgttttgctg ttgatctgga 1920 
aacaaatgat ggatcgtcag aaaageccta ctttatggat caagaatttc tgagtttcgt 1980 
aaaaaggagc aacaaattaa acaatgeaag ggccacagca gggacaagca ctccattaag 2040 
gaatgaggag gggaacagaa ctcccaggcc attgtgcaga tagataccca tttaggtatc 2100 
tgtacctggg aaaacatttc ettgetaaga ggecatttae agaatagaag atgagtatcc 2160 
actagagata agtttgtgaa tttttttttc tcaaaagacc taataaaacc ctattcatca 2220 
ctcattgtac tttgtcatta ttgttcgaac caggataaca atactggaac tatattagtt 2280 
tacctttttt tgtacaaatt aggacagaaa aactcttcta aaaccatgtt tatatgeate 2340 
aacttacaaa gtacactatg taagaactga ggtaag 2376 

<210> 107 
<211> 821 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:451682 . 1 : 2000FKB18 
<400> 107 

cccatctcgc cgccgccgcg atgtctcgcc gctatgacag tcgcacgacg atcttctcgc 60 

eggaggggeg tctctaccag gtggagtacg cgatggaggc gateggcaac geegggtcag 120 

ccctcgggat cctggcggcc gaeggegtag tectggtegg cgagaagaag gtgacctcca 180 

agctcctcca gacctcccgc tccgccgaga agatgtacaa gatcgactcc cacctggcgt 240 

gcgctgtggc agggatcatg tccgacgcca acatcctcat caacaccgcc cgcctccacg 300 

cccagcgcta cgcgctctcc taccaggagc ccatccccgt cgagcagctc gtccagtccc 3 60 

tctgcgacac caagcagggt tacacccagt ttggtggcct ccgccccttt ggggtctcct 420 

tcctattcgc cggttgggac aagcaccacg gtttccagct ctacatgagc gacccatctg 480 

geaactaegg cgggtggaag gccgccgctg ttggggccaa cagccaggcc gcgcagtcca 540 

tgetcaagea ggactacaag gaegecttga cccgcgagga ggccgtcggt ctcgcgctca 600 

aggtcctcag caagaccatg gactccacca gettgactge cgagaagctg gagctggccg 660 

aggtatttct gcagcccgac aceggggagg tacagtacca ggtgtgctct cctgaggcgc 720 

tggggaagct gettgeaaac tccggcctga cacagccaac acccgaggca taatgtactg 780 

gtttgcagtg gcatgtgatg ccttctactg ctcttagaca t 821 

<210> 108 
<211> 435 
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<212> DNA v 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1077283 . 1 :2000FEB18 
<400> 108 

cgctgtctct tttcgccgct tgctgcagac atggtcaacc ccaccgtgtt cttcgacatc 60 
acggctgatg gcgagccttt gggtcgcggt ctggcttcgg aagtggttgg cagacaaagt 120 
tccaaagaca gcagaaaact ttcgtgctct gagcactggg gagaaaggat ttggctataa 180 
cgggttcctg ctttcacaga attattccag gattcatgtg ccagggtggt gacttcacac 240 
gccataatgg cactggtggc aagtccatct acggagcaga aatttgcagg atgagaactt 300 
catcctgaag catacaggtc ctggcatctt gtccatggca aatgctggac caaacacaaa 360 
tggttcccag ttttttatct gcactgccaa gactgagtgg ctggatggca agcatgtggt 420 
ctttgggaag gtgaa 435 

<210> 109 

<211> 143 0 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

^223> Incyte ID NO: LG:481436 . 5 :2000FEB18 
<400> 109 

ggcagatttt ctaaacatag aaagggagtt cacatgctac atggtcaaaa ataaaactga 60 
tacatgtcca tttcagtatt caataaatat ttgtcagatg agtgaatgag agaatcagag 120 
tcatgatttc ctgtctctag acctttaagt caccaaacat tgaagtgata atggtgttgc 180 
taatgtaatt gaaaataaaa tcgaaggagc aagctatgga ggggaacagg gatgaagctg 240 
agaaatgtgt cgacatcgtc caggaggctc tgaatgctgg caaccgcgag aaggcccagc 300 
gtttcctgca gaaggccgag aagctctacc cactgccctt ggcctgcgca taaacaaata 360 
tcaaaattac tatgaaagta cttggagtta cgaaagatag tggtgatgaa gatttgaaaa 420 
aagcttatag aaagcttgct ttgaagtttc atccagacaa aaaccatgca cctggagcaa 480 
cagatgcttt taaaaagatt ggaaatgctt atgctgtttt aagtaatcca gaaaagcgaa 540 
aacagtatga cctcacgggc aatgaagaac aagcatgtaa ccaccaaaac aatggcagat 600 
ttaatttcca tagaggttgt gaagctgata taactccaga agacttgttt aatatatttt 660 
ttgggggtgg atttccttca ggtagtgtac attctttttc aaatggaaga gctggttata 720 
gccaacaaca tcagcatcga catagtggac atgaaagaga agaggaaaga ggagatggag 780 
gtttttctgt gtttatccag ctgatgccca taattgtatt gatcctcgtg tcattattaa 840 
gccagttgat ggtctctaat cctccttatt ccttatatcc cagatctgga actgggcaaa 900 
ctattaaaat gcaaacagaa aacttgggtg ttgtttatta tgtcaacaag gacttcaaaa 960 
atgaatataa aggaatgtta ttacaaaagg tagaaaagag tgtggaggaa gattatgtga 1020 
ctaatattcg aaataactgc tggaaagaaa gacaacaaaa aacagatatg cagtatgcag 1080 
caaaagtata ccgtgatgat cgactccgaa ggaaggcaga tgccttgagc atggacaact 1140 
gtaaagaatt agagcggctt accagtcttt ataaaggagg atgaactgga atttttattt 1200 
atacctttta gcgtactctt tattttttct gtaagtaagt ttggtttcat catgagggat 1260 
gaaggaaaag atttgatact gaaaactaaa ctgaatagtt ggttcctgaa atcttggact 1320 
gtttatgacc tactggctcc tttaaatagt aactgaaaac taaaatggaa tattttagtt 1380 
aacgcttcta caagtatttt cattttaaaa gcttacatga ttcctaacta 1430 

<210> 110 
<211> 889 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 793701. 1 :2000FEB01 
<220> 

<221> unsure 

<222> 730, 733-735, 797-798 
<223> a, t, c, g, or other 

<400> 110 
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ccgacagcat aatgtgaggc aaggtctagc tgttcccccc cggcatacaa gcttatagag 60 
cagcctcctt tgaagatctc caggtggact tcacagagat gccaaagtgt ggaggtgttc 120 
gagtgtggat caaggactgg aacgtagcct ctttgtgccc atggtggaaa ggaccccaga 180 
ctgtcgtcct gatcactccc actgctgtga acgtagagag aatcctagcc tggatccatc 240 
acaaccgtgt aaaacctgca gcgcctgaat cctgggaggc aagaccaagt ctggacaacc 300 
cctgcagagt gaccctgaag aagatgacaa gccctgctcc agtcacaccc agaagctgac 360 
tggtccacgc acagccgaag catgaggaag ctcattgtga gattcatttt tcttaaattt 420 
tggacttata cagtaagggc ttcaactgat cttactcaaa ctggggactg ttcccagtgt 480 
attcatcagg tcaccgaagt aggacagcaa attaaaacaa tctttctgtt ctatagttat 540 
tatgaatgtg tggaaacaat aaaaaaactt gtttgtataa tgccactcag tacaaggtat 600 
gtagcccgag aaatgaccga cctgatgcgt gttataaccc atctgagccc tccgcaacca 660 
ccgtttttga aataagaata agaactggcc ttttcctagg tgatacaagt aaaataataa 720 
ctagaacagn cannnaagaa attcccaagc aaataacttt aagatttgat gcttgtgcag 780 
ccattaatag taaaaannta gaaataggat gtggttctct taactgagaa aggagctaaa 840 
gagtagaaaa taaatatgtt tgtcatgagt caggggtttg taaaaattg 889 

<210> 111 

<211> 1106 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: hi :373637 . 1 :2000FEB01 
<220> 

<221> unsure 
<222> 974 

<223> a, t, c, g, or other 
<400> 111 

gccatccagc atggtggact actacgaggt gctggacgtg ccccggcagg cctcatccga 60 
ggccatcaag aaggcgtacc gcaagctggc gctcaagtgg caccccgaca aaaaccctga 120 
gaacaaggag gaagcggaga ggagattcaa gcaggtggcc gaggcctacg aggtgttgtc 180 
ggacgccaag aaacgcgata tctatgaccg ctatggcgag gcgggggcgg agggcggctg 240 
cacaggcggc aggcccttcg aggacccctt cgagtacgtc ttcagcttcc gcgacccagc 3 00 
cgacgtcttc agggagttct tcggcggcca ggacccattc tcctttgacc tcttgggaaa 3 60 
cccgctggag aatattttgg gggggtcaga ggaactgctg gggaagcaga agcagagcgt 420 
ctgcacccct tttctctgcc ttcagtgaat ttccagcttt tgggggtgtt ttttcttctt 480 
ttgatacagg atttcgttcc tttggctccc tgggaagtgg gggcctttct tccttctgca 540 
tgtcctacgg tagtgatggg acaggcagct tcaagtccat gtcgacttcc actgaaatag 600 
ttgatggtaa aaaaatcacc accaagagaa tcattgagaa tggccaagaa agggtggaag 660 
tggaggaaga tggagagttg agttaaagtc cttcataata aatagcaaag agcagttact 720 
ccgcattgac accaagtaat tccagtccac atttgacttt aagcacatct ggaggaatag 780 
cggacttttt ttaggattga aagtgaactt tactttcaga acagctgtac ccaagaattt 840 
ataaacactt catgccaaca cctgttctta tcgttgggac tacacggata ggacctctgt 900 » 
ttgtctttca atcgttgtaa aatatctgta tgcaatttgc tatttattaa acttaagcct 960 
gaggcagaca atgnttattt gctagtggta ggacagaatg ttcagttcag cactttgagc 1020 
attacctttt tccgttggga aagtgtgagc caggtggttt gctctaaatc atttgacttt 1080 
attgtttgtg tgtagtggtg aaatct 1106 

<210> 112 
<211> 461 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 239368 . 2 :2000MAY19 
<400> 112 

tcgactccga cctcgtgatc tgccccctcg gcctcccaat gctggatacg gcatgagcac 60 
acgcccagcc cctgcaagca gtttctttaa ctctgttggg atgccattcg tgggcaaagc 120 
tagatttggg acaagttgcc cagctctgct aggaagagtc agtcctcagg gggaagtttc 180 
ttttcacctc cgggatcccc agggcttcct gggttgactc aggacttcat acgcagcctg 240 
gatctccagg aagtgcctct gtgcctcctc tgtctggtcc aggttgtggt ctgggtgcca 300 
gaccttcact agctcctggt aactccgatg tatttcttca tttgttgccc cttctgagag 360 
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gcccaaaacc tggtaagcca gctgacgctt ctcatcctga aaactgtgaa caaactcata 420 
gagcttcgcc cactcctgaa agcagctgct gttgaagcca g 461 

<210> 113 

<211> 1450 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI: 053826 . 1:2000MAY01 
<400> 113 

gaattccccc cccccccccc tcacttggtg tgtctatatg tctggcagac attatcagca 60 
cattctctgt tgttacctgt gattcatttt ttcttcactc tccaggtgaa tttcaattgc 120 
tgaaaatttc ccactgaaaa tatgcagtaa tatattttgt ggttcagaca tttggggcaa 180 
atggttcaca ttcattttag ggttagtggt catgctgttt atttttctct gctatacaaa 240 
gttcctctta ggggtctgcc tcatgacact aaaaaatgaa tagagattct actgtaggtt 300 
atctcctagg cttgagttca acatttgttt ggatttttga agaaagtcaa atcaagcaat 3 60 
gctcccaaat gatgtctttg taaattcata ccctctggcc ctattttttt tcatagaccc 420 
taactctacc tttctgcttt aaagcaaagt aaactcggtg gcctcttctt ctccacccct 480 
caaaatgata gcaatctctg ccgtcagcag tgcactcctg ttctcccttc tctgtgaagc 540 
aagtaccgtc gtcctactca attccactga ctcatccccg ccaaccaata atttcactga 600 
tattgaagca gctctgaaag cacaattaga ttcagcggat atccccaaag ccaggcggaa 660 
gcgctacatt tcgcagaatg acatgatcgc cattcttgat tatcataatc cagttccggg 720 
ccaaggggtc ccccccggag gcaatattgc atattttggt tggggttgaa atcctgtgcc 780 
aatctcggca aaggcttggg cggctacttg catttgggac catggacctt cttacttact 840 
gagatttttg ggccaaaatc tatctgtacg cactggaagg taggaaatat cgctctattc 900 
tccagttggt caagccatgg tatgatgaag tgaaagatta tgcttttcca tatccccagg 960 
attgcaaccc cagatgtcct atgagatgtt ttggtcccat gtgcacacat tatacgcaga 1020 
tggtttgggc cacttccaat cggataggat gcgcaattca tacttgccaa aacatgaatg 1080 
tttggggatc tgtgtggcga cgtgcagttt acttggtatg caactatgcc ccaaagggca 1140 
attggattgg agaagcacca tataaagtag gggtaccatg ttcatcttgt cctccaagtt 1200 
atgggggatc ttgtactgac aatctgtgtt ttccaggagt tacgtcaaac tacctgtact 1260 
ggtttaaata agtttacctt ttcctccagg aaatataatg atttctggga acatgggcat 1320 
gtatatatat atatggagag agaattttgc acatattata catattttgt gctaatcttg 1380 
ttttcctctt agtattcctt tgtataaatt agtgtttgtc tagcatgttt gtttaatcct 1440 
ttgggaattc 1450 

<210> 114 
<211> 793 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 44 93 93 . 1:2000MAY01 
<400> 114 

cagccatgtc gctccgcgtg ctgaacccta acgccgaggt gctcaacaag tcggcagcgc 60 
tgcacatgaa catcaacgcc gccaagggcc tccaggacgt gctcaagacc aacctcggcc 120 
ccaagggcac catcaagatg cttgtgggtg gagctggtga cctgaagctg acgaaggatg 180 
ggaacaccct cttgaaggaa atgcaaattc agaacccaac tgcaatcatg attgcgagga 240 
cggctgtggc acaggatgac acaagtggtg atggcacaac ctccactgtg ctatttatcg 300 
gggagctaat gaagcagtct gagcggtgca tcgatgaagg aactcatcca agattcttgg 360 
ttgatggttt tgatgttgct aagagagcgt gtcttgattt tctcgataag ttcaagacgc 420 
cagtcgttac tggtgaagaa cctgacagag ataccttaaa aatggtagca agaacaacac 480 
ttaggacaaa gttgtatgaa gggttggctg atcagttgac tgatattgtt gtgaatgcag 540 
ttctgtgcat tcgcaaacca gatgaaccaa ttgatctatt tatggtggaa ataatgcaca 600 
tgcgccacaa atttgatgtt gacacccgtc tggttgaggg tctggtcctg gaccacggtt 660 
ctcgtcaccc tgacatgaag cgcagggcag agaattgtta catcttgacc tgcaatgtct 720 
cattagaata tgagaaaagt gaaatcaatg caggattttt ttactcaaac gcagaacaaa 780 
aaaaaaaaaa agg 793 

<210> 115 
<211> 2494 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 1071427 . 96 :2000MAY01 
<400> 115 

atgaaagtca ggcttttatt gagatgagac aagagaagat gcaatggcaa tggttgacca 60 
ttgtttgaaa aaagcccttt ggtttcaggg gagatgtgtg aaggttgacc tgtctgagaa 120 
atataaaaaa ctggttctga ggattccaaa cagaggcatt gatttactga aaaaagataa 180 
atcccgaaaa agatcttact ctccagatgg caaagaatct ccaagtgata agaaatccaa 240 
aactgatggt tcccagaaga ctgagagttc aaccgaaggt aaagaacaag aagagaagtc 3 00 
cggtgaagat ggtgagaaag acacaaagga tgaccagaca gagcaggaac ctaatatgct 3 60 
tcttgaatct gaagatgagc tacttgtaga tgaagaagaa gcagcagcac tgctagaaag 420 
tggcagttca gtgggagacg agaccgatct tgctaattta ggtgatgtgg cttctgatgg 480 
gaaaaaggaa ccatcagata aagctgtgaa aaaagatgga agtgcttcag cagcagcaaa 540 
gaaaaagctt aaaaagcgtc gttttccagg gagtatggaa ggttttgtca ctctagatga 600 
ggttggtgat gaggaagatt cggaacttca gaaacttcgt aaatcgggca tggcatttaa 660 
atctggtgac aaaaatgatg atggtttggt tgaaattaag gtggacaaga tcgaggaact 726 
tgatcaagaa aacgaagcag cgttggaaaa tggaattaaa aatgaggaaa acacagaacc 780 
aggtgctgaa tcttctgaga acgctgatga tcccaacaaa gatacaagtg aaaacgcaga 840 
tggtcaaagt gatgagaaca aggacgacta tacaatccca gatgagtata gaattggacc 900 
atatcagccc aatgttcctg ttggtataga ctatgtgata cctaaaacag ggttttactg 960 
taagctgtgt tcactctttt atacaaatga agaagttgca aagaatactc attgcagc.ag 1020 
ccttcctcat tatcagaaat taaagaaatt tctgaataaa ttggcagaag aacgcagaca 1080 
gaagaaggaa acttaagatg tgcaaggaga tttaatgatt tcaaagaaaa tgatggttct 1140 
ttgtttttaa tgttaacctt ttttaaatac aatactgata gaagaaaact attgtactct 1200 
tttgttttag tggagaaata atagatgtct gttcatgtgt taagtgttat agcaaaaaaa 1260 
atacacatat ggttaagtta atgaatagtt tttgttttat cagaatggca acagacagaa 1320 
gtactttgta gagattgact tcctaagcta cttaagacaa cttgcaccac taagaaaaaa 1380 
atgtagaacc atttggaaaa atgaaattta gtagttccaa gtttcaaaga aatgtcaaca 1440 
ttttattcca ttcaataaag aacaaaacca atagtgtttt tattactttc atctgaaaca 1500 
ttccatgttt taatctgagc cttgcagact ttcatttgga gtttgaaccc gttttggttg 1560 
catttcattt ttggagaact taattaacgt gagattggca attgaaatgc aggtgcagtt 1620 
ttctgttaat gtcatgctgt tgtttaggta ataagaaata ttaagtaatt ggctttagat 1680 
tttgtaattt ttttccctga gttcctgcta gatttcgtat tctagtagtc aatgtatttt 17'40 
cagtgaaatg caaaaatatt cccattatct ttgaccagta ttaatttttg agatcttact 1800 
gcttgtcact tgaatcccgt gattgtcata catctctggt ataagcaaca tttgattttt 1860 
gaagtgtgta gaccatctct tcatattttc aagatgtaat tttacatttc tgcattttta 1920 
aaacagtttg gccataatcc tagatgcacg cttctaattc atgtacctgc acatgtgacc 1980 
tttgtgaaca gaaatttgca tgtataattt gtgtttactt gtaactttct ggttatatac 2040 
tgcttatatc tgtggattca agttactgaa gtgaatacca ataaaaagaa aaccctaggc 2100 
catgttaatt ggttatacat gtttggaatg ttaaccaacg tatttgtcag ttgtggtttt 2160 
tattcgctct taaactttgt gcatgcttta acaatttatt acttttaaat ctagagtgaa 2220 
ttctaaagac tgccgctaaa gatctgagtt ttaaaaatgt tgttgctggt ggatttcttg 2280 
ttcctgttac ataactaaaa gtgaggccat ttgtggtttt taaaaacctt atgaattaaa 2340 
aatgctacag gtgaacaaac agaagcttat gtttagagag attgatgact taacagtaca 2400 
attggaagta atacggatga gcaagaatta gttctgcagc ttttcaaaat aattacgtag 2460 
agactcttgg tatattggat tatctgttgt aaac 2494 

<210> 116 
<211> 957 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID^No: LI:33 6338.8:2000MAY01 
<220> 

<221> unsure 

<222> 684, 708, 743 

<223> a, t, c, g, or other 

<400> 116 

gcatcttata atatgtggca gtttacaact gctttgacca gaaaactgat gtgatatcag 60 
tgttactgcc cagatatgag agttggatgc agagacatct gttcaaccga gttaggcaca 120 
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tgactacttc ttcttgtgtt gaatttaatt cttcttcttg tgttgaattt aattgccgtt 180 
ctaaagaggt ttgaaaatta aactattgtt caatattcga aggaaagatt cttgcataag 240 
catggaaaca aacaaaacag aatgttacat aagtataaca aatactttta ataagtctaa 300 
aattaaaatt ctaattgata agacaatatt gtgtcacagt ttaatatgaa gcatcttttc 360 
ctttgtgtta cttaaaatga tggtgcatct taaatcttgt gtcttagatt caatgaaata 420 
tggtataaat gactgtatac tctctgctct aggtatctca aagaagacaa atttcttgct 480 
tgtgttatta attgatttct aatttattct ttttctaatc tttttttttt tttcacaaat 540 
ggataccaat tacatttacc atgggactgc aatgtctctc ctctccttga aaaacaatgc 600 
ggttgactga aatcacttgg ggtggaccaa atttcctcct gttgactact cctgaatgat 660 
ctcttccaat tccccaaatc tgcntctctg gccaatcaca accttctncc atgtgtgtac 720 
atcatgct!gc tctcgtcttc aantcccctt ctctcttgct gacttctgga aaagcaacgg 780 
aagagtcctg ggcggaagga ggcttctgta tgcttgtgaa aaagaacaat ctgtgccaac 840 
ggaaggttct tcaacaactt tgctgcaaaa catgtacatt tcaaggctga gcagccatct 900 
tagatttctt tgttcctgta gacttataga ttattccata ttattaaaaa gaaaaaa 957 

<210> 117 

<211> 1269 

<212> DNA 

<213> Homo sapiens 

<220> .i 
<221> mis cofeature 

<223> Incyte ID No: LG: 345527 . 1 :2000FEB18 
<400> 117 

cagggagtta aaacgattca atgctgataa caaacttctt ttgactggta ctcccttgca 60 
aaacaattta tcagaacttt ggtcattgct aaactttttg ttgccagatg tatttgatga 120 
cttgaaaagc tttgagtctt ggtttgacat cactagtctt tctgaaactg ctgaagatat 180 
tattgctaaa gaaagagaac agaatgtatt gcatatgctg caccagattt taacaccttt 240 
cttattgaga agactgaagt ctgatgttgc tcttgaagtt cctcctaaac gagaagtagt 300 
cgtttatgct ccactttcaa agaagcagga gatcttttat acagccattg tgaaccgtac 360 
aattgcaaac atgtttggat ccagtgagaa agaaacaatt gagttaagtc ctactggtcg 420 
accaaaacga cgaactagaa aatcaataaa ttacagcaaa atagatgatt tccctaatga 480 
attggaaaaa ctgatcagtc aaatacagcc agaggtggac cgagaaagag ctgttgtgga 540 
agtgaatatc cctgtagaat ctgaagttaa tctgaagctg cagaatataa tgatgctact 600 
tcgtaaatgt tgtaatcatc catatttgat tgaatatcct atagaccctg ttacacaaga 660 
atttaagatc gatgaagaat tggtaacaaa ttctgggaag ttcttgattt tggatcgaat 720 
gctgccagaa ctaaaaaaaa gaggtcacaa ggtgctgctt ttttcacaaa tgacaagcat 780 
gttggacatt ttgatggatt actgccatct cagagatttc aacttcagca ggctgatggg 840 
tccatgtctt actcagagag agaaaaaaac atgcacagct tcaacacgga tccagaggtg 900 
tttatcttct tagtgagtac acgagctggt ggcctgggca ttaatctgac tgcagcagat 960 
acagttatca tttatgatag tgattggaac ccccagtcgg atcttcaggc ccaggataga 1020 
tgtcatagaa ttggtcagac aaagccagtt gttgtttatc gccttgttac agcaaatact 1080 
atcgatcaga aaattgtgga aagagcagct gctaaaagga aactggaaaa gttgatcatc 1140 
cataaaaatc atttcaaagg tggtcagtct ggattaaatc tgtctaagaa tttcttagat 1200 
cctaaggaat taatggatta ttaaaatcta gagattatga aagggaaata aaaggatcag 1260 
agagaggtc 1269 

<210> 118 
<211> 512 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1089383 . 1 :2000FEB18 
<400> 118 

cctgtagcca ctgaggttca caggctctgt ttgaggtgtt gtgttagatt gtgctgggtt 60 
ggttttgctg cttattgagt gatgaagagc aacatattag ctactgagat attttgcaga 120 
aaaaggaaga gatgtcttca gctcaaatgc tatcaaactg gtgaaataaa cagcgttgtt 180 
gctcacacaa agcctgtttg gtagtctctt cacatggacg cgtgtgacat ttggtgctga 240 
agacccggga caggaggact ccttcaggag acgggtcccc tgtccttgcc ctcactccgt 300 
gaggagatcc acctacgacc tcaggtcctc agaccaacca gcccaaggaa catctcatga 360 
atttcaaatc ggattcccaa ctatatgaag acaccctagc tggacgatca gttcttatta 420 
agaacctgac ccctcaaact ctacaacctc gatggactgg accctactta gtcatctata 480 
gtaccccaac tgccgtccgc ctgcaggatc ct 512 
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<210> 119 
<211> 530 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1092522 .1:2000FEB18 
<400> 119 

ggctaatgac ttattacttg ctgtttatat ttattttgga ctatggaaat gatgctagac 60 
aaaaagcaaa gttgagtgat tttcttattt aagttcaaag tgggtcgtaa aacagcagag 120 
acaacttgca atatcaacaa tgcatttggc ccaggaactg ctaatgaacg tacagtgcag 180 
tggtggttca agacactttt caaaggagat gagaaccttg aagatgagga gcatagtggc 240 
caaaccactg gaagttgaca atgaccaact gagagcaatc agcaaagctg atcctcttaa 300 
ggctacttga gatgttgccg aagaactcaa tgtggaccat tctacagtca tttggcattt 360 
gaagcaaatt ggaagagtga aaaagctcga taagtgggtg cctcatgagc tgagtgaaaa 420 
tttaaaaaat agtcattttg aagtgtcctt atcccttatt ttttgcaaca acaaaccatt 480 
tcttggtcag attgtgatgt gcgagaaaaa gtggatttta tacaacaact 530 

<210> 120 
<211> 671 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: IjG:1093216 . 1 :2000FEB18 
<220> 

<221> unsure 

<222> 436, 454, 499, 554 
<223> a, t, c, g, or other 

<400> 120 

aactgttttc ctttgaggaa tattgatgat cccactgctt gagtaatgga aaccagcttg 60 

cagctgatag ttattttttg tcttttttcc tgttctatcc actgtcctag gtttggaatt 120 

tcagctgggc agcttcgatc ccaccacctg tagccactga ggttcacagg ctctgtttga 180 

ggtgttgtgt tagattgtgc tgggttggtt ttgctgctta ttgagtgatg aagagcaaca 240 

tattagctac tgagatattt tgcagaaaaa ggaagagatg tcttcagctc aaatgctatc 300 

aaacttgtga aataaacagc gttgttgctc acacaaagcc tgtttggtag tctcttcaca 3 60 

tggacgcgtg tgacatttgg tgctgaagac ccgggacagg aggactcctt caggagacgg 420 

gtcccctgtc cttgcnctca ctccgtgagg aganccacct acgacctcgg gtcctcagac 480 

caaccagccc aaggaacanc tcatgaattt caaatcggat tcccaactat atgaagacac 540 

cctagctgga cgancagttc ttattaagaa cctgacccct caaactctac aacctcgatg 600 

gactggaccc tacttagtca tctatagtac cccaactgcc gtccgcctgc aggatcctcc 660 
ccactgggtt c 671 

<210> 121 
<211> 257 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI:270318 .3 :2000FEB01 
<400> 121 

ttctttgaga ctatttaaaa actggtacaa caggtctcta caacgccaag atctaactaa 60 
gctttaaaag ggatagctgg gagtatggcc accctgctcc acgatgcggt aatgaatcca 120 
gcagaagtgg tgaagcagcg cttgcagatg tacaactcgc agcaccggtc agcaatcagc 180 
tgcatccgga cggtgtggag gaccgagggg ttgggggcct tctacggagc tacaccacgc 240 
agctgaccat gaacatc 257 

<210> 122 
<211> 1353 
<212> DNA s 
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<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 335671 . 2 :2000FEB01 
<400> 122 

gggaagtgaa tatccctgta gaatctgaag ttaatctgaa gc tgcagaat ataatgatgc 60 

tacttcgtaa atgttgtaat catccatatt tgattgaata tcctatagac cctgttacac 120 

aagaatttaa gatcgatgaa gaattggtaa caaattctgg gaagttcttg attttggatc 180 

gaatgctgcc agaactaaaa aaaagaggtc acaaggtgct gcttttttca caaatgacaa 240 

gcatgttgga cattttgatg gattactgcc atctcagaga tttcaacttc agcaggcttg 300 

atgggtccat gtcttactca gagagagaaa aaaacatgca cagcttcaac acggatccag 360 

aggtgtttat ctcttagtga gtacacgagc tggtggcctg ggcattaatc tgactgcagc 420 

agatacagtt atcatttatg atagtgattg gaacccccag tcggatcttc aggcccagga 480 

tagatgtcat agaattggtc agacaaagcc agttgttgtt tatcgccttg ttacagcaaa 540 

tactatcgat cagaaaattg tggaaagagc agctgctaaa aggaaactgg aaaagttgat 600 

catccataaa aatcatttca aaggtggtca gtctggatta aatctgtcta agaatttctt 660 

agatcctaag gaattaatgg aattattaaa atctagagat tatgaaaggg aaataaaagg 720 

atcaagagag aaggtcatta gtgataaaga tctagagttg ttgttagatc gaagtgatct 780 

tattgatcaa atgaatgctt caggaccaat taaagagaag atggggatat tcaagatatt 840 

agaaaattct gaagattcca gtcctgaatg tttgttttaa agtggagctc aagaatagct 900 

tttaaaagtt cttatttaca tctagtgatt tccctgtatt gggtttgaaa tactgattgt 960 

ccacttcacc ttttttatca tatcagttga catgtaacta gtaccatgcg tacttaaata 1020 

gatggtaatt ctctgagcct taccaagaac aaagaagtat ccatattaag tttagatttt 1080 

cagttaattt ttgagactga gtagtattct tggatacagg ctgatgtgta cttaaccact 1140 

tccagattta tacagtcttc qtgtggaagt ttagtaaatg tctttttccc tcctttcttc 1200 

tagtaatgca gttcatgggc tttaggtact tcagttatga agtaggcttt tcatggggag 1260 

agattgggat tatgctctct gttgtttaag aaactgtttg attttagagt ctatttctat 1320 

gagatagttt accaaataaa tgttccttat aag 1353 

<210> 123 
<211> 671 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 793758 . 1: 2000FEB01 
<220> 

<221> unsure 
<222> 554 

<223> a, t, c, g, or other 
<400> 123 

aactgttttc ctttgaggaa tattgatgat cccactgctt gagtaatgga aaccagcttg 60 
cage tga tag ttattttttg tcttttttcc tgttctatcc actgtcctag gtttggaatt 120 
tcagctgggc agcttcgatc ccaccacctg tagecactga ggttcacagg ctctgtttga 180 
ggtgttgtgt tagattgtgc tgggttggtt ttgetgetta ttgagtgatg aagagcaaca 240 
tattagctac tgagatattt tgcagaaaaa ggaagagatg tcttcagctc aaatgetate 300 
aaacttgtga aataaacagc gttgttgctc acacaaagcc tgtttggtag tctcttcaca 360 
tggacgcgtg tgacatttgg tgetgaagae cegggacagg aggactcctt caggagaegg 420 
gtcccctgtc cttgccctca etcegtgagg agatccacct acgacctcgg gtcctcagac 480 
caaccagccc aaggaacatc tcatgaattt taaateggat tcccaactat atgaagacac 540 
cctagctgga cgancagttc ttattaagaa cctgacccct caaactctac aacctcgatg 600 
gactggaccc tacttagtca tctatagtac cccaactgcc gtccgcctgc aggatcctcc 660 
ccactgggtt c 671 

<210> 124 
<211> 512 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 803718 . 1 : 2000FEB01 
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<220> 

<221> unsure 
<222> 290 

<223> a, t, c, g, or other 
<400> 124 

cctgtagcca ctgaggttca caggctctgt ttgaggtgtt gtgttagatt gtgctgggtt 60 
ggttttgctg cttattgagt gatgaagagc aacatattag ctactgagat attttgcaga 120 
aaaaggaaga gatgtcttca gctcaaatgc tatcaaactg gtgaaataaa cagcgttgtt 180 
gctcacacaa agcctgtttg gtagtctctt cacatggacg cgtgtgacat ttggtgctga 240 
agacccggga caggaggact ccttcaggag acgggtcccc tgtccttgcn ctcactccgt 300 
gaggagatcc acctacgacc tcaggtcctc agaccaacca gcccaaggaa catctcatga 360 
atttcaaatc ggattcccaa ctatatgaag acaccctagc tggacgatca gttcttatta 420 
agaacctgac ccctcaaact ctacaacctc gatggactgg accctactta gtcatctata 480 
gtaccccaac tgccgtccgc ctgcaggatc ct 512 

<210> 125 
<211> 782 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI:412179 . 1 :2000FEB01 
<400> 125 

aggcagcatg acaacatgat ggacagagca cggattttct gagggataca gccaggtccc 60 
acccctaatt agcttcttga gctcagctaa gttcttaaaa cttctctgag cccttgattt 120 
tctccatctg catgatggat tattttatag gttgaccact cgtcctggtt tgcctgagac 180 
tgaagggcat cctgggacac agaacttcca gtgctaaaac ccagaaaatc ccaggcatat 240 
caagatgagt tggtcactta gaggacagat gaattgaaat gacatataga aatggactca 300 
cactgtgccc tgcatctaat agctgctcta taaatgtagt cacacaaaaa cgtgctcctc 360 
acatggtaaa agaaaaataa aatgaaacaa ataggtgagt gcaaaataat tgcagttttt 420 
gaattgttgg aatttgctgt ataatactgg aatacattct taaataaatg tggttatgtt 480 
atacatcatt ttaatggaca tttctcgctt tatgtttgct aatgacttat cacttgctgt 540 
ttattttatg tgtatttaaa ctatagaaat gatgttagac ataaagcaaa ttcaagtgat 600 
tttcttattc gagttcaaaa tgggtcgtaa aatagcagag acaactcgta acatcgacaa 660 
tgcatttggc ccaggactgc taacgaatgt acagtgcagt ggcagttcaa ggagacaagg 720 
agcgtagtgg tgggccattg gaagttgaca atgacaaact aagagcaatc attgaagctg 780 
at 782 

<210> 126 

<211> 1937 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 815679 . 1 :2000FEB01 
<220> 

<221> unsure 
<222> 635 

<223> a, t, c, g, or other 
<400> 126 

ccaggtctac atgtataaaa ttttttaaca gtaacaaaat actttgcttt ctttttatat 60 
taagtcttca caatctaatg tgtatttgac acttctcgca catttcagaa tgatggcagc 120 
agcccatatg gggggcccgc ccatgatgcc aaagatgggc cctcctcctc ctgggatgat 180 
gccagtggga cctgctcttg gaatgaggcc gcccatggga ggccacatgc ctgtgatgcc 240 
tgggtgccca atgataagac ctcctgcccg tctcatgatg gtgcccagtc agcccagaat 300 
gacttgacca gacagataag gatagagggg aggcctcatt gcatcagtgt tgttttgttg 360 
ttgttattgt tgtgtttttt ttgtttgtaa tgttttgttt tgtttttgag acagagtctt 420 
cctctgtcgc ctaggctgga gggcagtggc gtgatctcgg ctcactggaa cctccgcctc 480 
ccgggttcaa gcgattcccc tgcctcggcc tcctgagtag tgtgggactg caggcgtatt 540 
caccatgccc cggctaattt tttttattta gtggagacgg ggtttcaccg tgttggccag 600 
atggtctcga tctcctgacc tcgtgactcg ctcgntcagc ctctgaaggt gctgggattg 660 
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caggtgtgaa gccactgcgc ctggcctata tgaattttat atttacctgc tcccttcacc 720 
aggagatcat gctgctgtga tatcgagttt tcttaacagc ataaggaaga cttgccctct 780 
tgccctatca aagagaatag ttttggaggg gagaagtggg accaaaaaag atgcagtttt 840 
catttgtatt gggaaatgtg aaaataaaat tgccaactgc tttagttaaa aacaaaaaaa 900 
aagaaaagga aacaagatgt ggggctgcca tatgcaatac cgtggattct acagatcttc 960 
tactctggag gcaaatatta tcttcgctga agccagacca acctgacaca aaggcctttt 1020 
gtttttttaa tgtgactgtg ttttatttta gaatgtgtaa ttcactttag aagggcaaag 1080 
tacctgtctg gggaagacta tttaatttcc tgcatttatt tagaatgctg gctgatgtta 1140 
ttatgaaggg aaacagctct aacaactgag tgtcccccac atagacacag ctcatgagtt 1200 
cacggggcaa aggaattgaa cagcagcctc ctaatagccg gccttctttg tgatgtggaa 1260 
ataattatca gcatgtaaaa gactatatat atattcaaca attctgaccc cctgcaaaat 1320 
tcaaatctac aactgatttg cttcctgggc tcctgaaaac aactttgtca aaattgttca 1380 
gaaatataat cagccaatcg ttgccccttg gggacgcagg ataaagcaag tcatccatga 1440 
ccaatgtgga gtcggccgtg cacaatgaca tgcagacttg caggacatcg agtccctgct 1500 
atggtccctc cccagtcagg cccccattgc ctgggctgca gccagcagca ttcaggcaca 1560 
agtgcattca acaaatactt atttaattgt attggtggtt agagggttgt gattgattaa 1620 
ggtacattaa tggatccatg tcctccctgt atccaagact ctgccatttg tctctgcagt 1680 
tcctcccact gaagaatcgg agtatatttc tccagtccct aatgttgggt tttgaattgt 1740 
gtctagcttt ggccactgga atattaatct gtatgaccaa aaacttggaa agtgtgaatt 1800 
catttgtgct cgctcactcc tgctatcacc atgagaacaa gcctaggcca ggctgctgct 1860 
tccagcagaa gataagagac accaagaaca aagtcgagct tcccagacat gctcatgcta 1920 
gactgaccaa tcctcag 1937 

<210> 127 
<211> 504 
<212> DMA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 481361 . 3 :2000FEB01 
<400> 127 

ggagccctgg ggatatgatg ccagccaggg aaggtgagaa tgtggtatgt gtgtgtgtgt 60 
agtgccccct gcaggagatg aagccccaga gaaacacagc ggaccttctg ccaaaactca 120 
agtccatggc cctggctgac agagctgtgt ttgaaaaggg catgaaagct tttgtgtcat 180 
atgtccaagc ttatgcaaag catgaatgca acctgatttt cagattaaag gatcttgatt 240 
ttgccagcct tgctcgaggt tttgccctgc tgaggatgcc caagatgcca gaattgagag 300 
ggaagcagtt tccagatttt gtgcccgtgg acgttaatac cgacacgatt ccatttaaag 3 60 
ataaaatcag agaaaagcag aggcagaaac tcctggagca acaaagaaga gagaaaacag 420 
aaaatgaagg gagaagaaaa ttcataaaaa ataaagcttg gtcaaagcag aaggccaaaa 480 
aagaaaagaa gaaaaaaatg aatg 504 

<210> 128 
<211> 971 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 2473 88 . 1 :2000MAY19 
<220> 

<221> unsure 
<222> 46 

<223> a, t, c, g, or other 
<400> 128 

aacaggaaac aattattgaa gtagagctga ctaatatcca gaaganatac tatcgggcta 60 
ttttggagaa gaatttctcc ttcctttcca aaggggcagg tcataccaac atgcctaatc 120 
tacttaacac aatgatggag ttgcgcaagt gctgcaacca cccatatctc atcaatggtg 180 
ctgaagaaaa aatcctaaca gaattccgtg aagcttgcca tattatacct catgaccttt 240 
cacctgcagg ccatggttcg ttcagccggc aaactggttc ttattgacaa gttgcttcca 300 
aagcttaaag ctggtggcca taaagttctg atcttctctc agatggtacg ctgcctagac 3 60 
atcctagagg attatttaat ccagaggagg tacttatatg aacgtattga tgggcgagtt 420 
agaggcaacc ttcgacaggc tgccattgac cgcttcagca agcctgactc agaccgcttt 480 
gtcttcttac tgtgtacccg ggctggtgga cttggtatta atcttacagc tgctgatacc 540 
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tgcatcatct ttgattcaga 
catcgaattg ggcagagcaa 
gagagagaga tgtttgataa 
tccatgagtg gtcgggatgg 
gaagatcttt taagaaaagg 
aagttttgtg aagaggacat 
gaatctgaag gaaaaggttc 
acagatattt c 

<210> 129 
<211> 192 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__feature 
<223> Incyte ID No: LG: 255789 . 10 :2000MAY19 

<400> 129 

aactaaaaga attgacaatt ttattttcac atttcccaat acaaatgaaa actgcatctt 60 
ttttggtccc acttctcccc tccaaaacta ttctctttga tagggcaagg gggcaagtct 120 
tccttatgct gttaagaaaa cccagcatca cagcccatga tctcctggtg aagggagcag 180 
gtaaatataa ag 192 

<210> 130 

<211> 2002 * 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI:787618. 1 :2000MAY01 
<220> 

<221> unsure 
<222> 635 

<223> a, t, c, g, or other 
<400> 130 

ccaggtctac atgtataaaa ttttttaaca gtaacaaaat actttgcttt ctttttatat 60 
taagtcttca caatctaatg tgtatttgac acttctcgca catttcagaa tgatggcagc 120 
agcccatatg ggtggcccgc ccatgatgcc aaagatgggc cctcctcctc ctgggatgat 180 
gccagtggga cctgctcttg gaatgaggcc gcccatggga ggccacatgc ctgtgatgcc 240 
tgggtgccca atgataagac ctcctgcccg tctcatgatg gtgcccagtc agcccagaat 300 
gacttgacca gacagataag gatagagggg aggcctcatt gcatcagtgt tgttttgttg 360 
ttgttattgt tgtgtttttt ttgtttgtaa tgttttgttt tgtttttgag acagagtctt 420 
cctctgtcgc ctaggctgga gggcagtggc gtgatctcgg ctcactggaa cctccgcctc 480 
ccgggttcaa gcgattcccc tgcctcggcc tcctgagtag tgtgggactg caggcgtatt 540 
caccatgccc cggctaattt tttttattta gtggagacgg ggtttcaccg tgttggccag 600 
atggtctcga tctcctgacc tcgtgactcg ctcgntcagc ctctgaaggt gctgggattg 660 
caggtgtgaa gccactgcgg cctggcctat aatgaattta taatttaacc tgctcccttc 720 
acccaggaga tcatgctgct gtgatgtcga gttttcttaa cagcataagg aagacttgcc 780 
ctcttgccct atcaaagaga atagttttgg aggggagaag tgggaccaaa aaagatgcag 840 
ttttcatttg tattgggaaa tgtgaaaata aaattgacaa ctcttttagt taaaaacaaa 900 
aaaaaagaaa aggaaacaag atgtggggct gccatatgca ataccgtgga ttctacagat 960 
cttctactct ggaggcaaat attatcttcg ctgaagccac gaccaacctg acacacaagg 1020 
ccttttggtt ttttaaatgt gactgtgttt tattttagaa tgtgtaattc actttaggaa 1080 
gggcagaagt acctggtctg ggggaagact atgttaattt cctgcatgtt atttagaatg 1140 
ctggctgatg gttatctatg aagggaaaca gctctaacaa ctgagtgtcc cccacataga 1200 
cacagctcat gagttcaagg ggcaaaggaa ttgaacagca gcctcctaat agccggcctt 1260 
ctttgtgatg tggaaataat tatcagcatg taaaagtcta tatatatatt caacaattct 1320 
gaccccctgc aaaattcaaa tctacaactg atttgcttcc tggggctcct gaaaacaact 1380 
ttgtcaaaat tcgttcagaa atataatcag ccaatcgttg ccccttggga ccgcaggata 1440 
aagcaagtca gccatgacca atgaggagtc ggccgtgcac aattacatgc agacctgcag 1500 
gacatacgag tccctgctat ggtccctccc cagtcaggcc cccactgcct gggctgcagc 1560 
tagaagcatt caggcacaag tgcattcaac aaatacttat ttaattgcat tggtggctag 1620 
agggctgttg ttgattaagg tacattaatg gatccatttc ctccctgtat ccaagactct 1680 



ctggaatcca caaaatgacc 
agctgtgaag gtgtaccgcc 
ggccagcctc aagttggggt 
caacattact ggaatccaac 
agcatatgca gccatcatgg 
tgaccagatc ttgttaagac 
cacctttgct aaggcaagct 



tgcaggccca ggcacgatgt 600 
tcatcactcg taattcctac 660 
tggataaggc tgtgcttcaa 720 
agttctctaa gaaggagatt 780 
aggaagatga tgaaggctcc 840 
gaactacaac catcaccatt 900 
ttgttgcttc tgaaaacagg 960 

971 
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gccatttgtc tctgcagttc ctccccactg aagaatcggg agtatatttc tccagcccct 1740 
aatgttgcat ttagtcacat gtctagcttt ggccactgga atattaatct acatgaccaa 1800 
aaacttggaa agtgtgcatt catttgtgct cactcactcc tgctatcacc aaaagaacaa 1860 
gcccaggcca gactgctgct tccagcagaa gataagagac accaagagca aagtcaagct 1920 
tcccagacac gctcatgcca gattaaccaa tcctcagcta gacccatagc tccatgaaaa 1980 
taaatgactg ttgtattaag cc 2002 

<210> 131 

<211> 8277 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: IiI:331610.2:2000MAY01 
<220> 

<221> unsure 

<222> 4213, 4217, 4222 

<223> a, t, c, g, or other 

<400> 131 

agcaatgcaa gattagatct ctaaatgcag caaaacactg cctgaaaaca gaccggcccg 60 
cgcagcaagc agacatttca cggtgcgctg gggaagctte aaaatata.tc tg.tgac.tctg 120 
tcttcgttgc tcttcatccc catcaatttc atcacgggag gcgagcagca agtaagaatt 180 
tcactttcgg atctgcctag agacacacct ccctgctccc tcccccactc gatgtgaaga 240 
gtattccgga gtctccgggc gggagtagat ttgcagcacc cctaagcggg agcgagggaa 300 
aacctactga ttctttagct cattatcatc tctcccagac gagatttcct tcttatcgcc 360 
tgcctcatcg ctcaagtttg agcctcccga agtccgggcg ggagagacga aacccctggc 420 
tcacccccag ccgcaggaag ccaccgcctt gctccaagcc cctgcagctc tgctgcaccg 480 
cagcttctca cccagtgcgg atgctgtaga tcaacaggtt cagggaactt gagcagaata 540 
aggagagacc accgggtgcc gcagctcggg tgcagaggga aaaaaggacc catagacttg 600 
tggctcgcgt cgcgcgcgca cgctgcgcca ggggcccagg ctggggcgca ctccctctct 660 
gggtcctcca gtccgattgc cctgccccca ccttacaggt ctgggatgta cctttccatc 720 
tgttgctgct ttcttctatg ggcccctgcc ctcactctca agaacctcaa ctactccgtg 780 
ccggaggagc aaggggccgg cacggtgatc gggaacatcg gcagggatgc tcgactgcag 840 
cctgggcttc cgcctgcaga gcgcggcggc ggagggcgca gcaagtcggg tagctaccgg 900 
gtgctggaga actccgcacc gcacctgctg gacgtggacg cagacagcgg gctcctttac 960 
accaagcagc gcatcgaccg cgagtccctg tgccgccaca atgccaagtg ccagctgtcc 1020 
ctcgaggtgt tcgccaacga caaggagatc tgcatgatca aggtagagat ccaggacatc 1080 
aacgacaacg cgccctcctt ctcctcggac cagatcgaaa tggacatctc ggagaacgct 1140 
gctccgggca cccgcttccc cctcaccagc gcacatgacc ccgacgccgg cgagaatggg 1200 
ctccgcacct acctgctcac gcgcgacgat cacggcctct ttggactgga cgttaagtca 1260 
cgcggcgacg gcaccagagt tcccagaact ggtcatccag aaggctctgg accgcgagca 1320 
acagaatcac catacgctcg tgctgactgc cctggacggt ggcgagcctc cacgttccgc 1380 
caccgtacag atcaacgtga aggtgattga ctccaacgac aacagcccgg tcttcgaggc 1440 
gccatcctac ttggtggaac tgcccgagaa cactccgctg ggtacagtgg tcatcgatct 1500 
gaacgccacc gacgccgatg aaggtcccaa tggtgaagtg ctctactctt tcagcagcta 1560 
cgtgcctgac cgcgtgcggg agctcttctc catcgacccc aagaccggcc taatccgtgt 1620 
gaagggcaat ctggactatg aggaaaacgg gatgctggag attgacgtgc aggcccgaga 1680 
cttggggcct aaccttatcc cagcccactg caaagtcacg gtcaagctca tcgaccgcaa 1740 
cgacaatgcg ccgtccatcg gtttcgtctc cgtgcgccag ggggcgctga gcgaggccgc 1800 
ccctcccggc accgtcatcg ccctggtgcg ggtcactgac cgggactctg gcaagaacgg 1860 
acagctgcag tgtcgggtcc taggcggagg agggacgggc ggcggcgggg gcctgggcgg 1920 
gcccgggggt tccgtcccct tcaagcttga ggagaactac gacaacttct acacggtggt 1980 
gactgaccgc ccgatggacc gcgagacaca agacgagtac aacgtgacca tcgtggcgcg 2040 
ggacgggggc tctcctcccc tcaactccac caagtcgttc gcgatcaaga ttctagacga 2100 
gaacgacaac ccgcctcggt tcaccaaagg gctctacgtg cttcaggtgc acgagaacaa 2160 
catcccggga gagtacctgg gctctgtgct cgcccaggat cccgacctgg gccaagaacg 2220 
gcaccgtatc ctactctatc ctgccctcgc acatcggcga cgtgtctatc tacacctatg 2280 
tgtctgtgaa tcccacgaac ggggccatct acgccctgcg ctcctttaac ttcgagcaga 2340 
ccaaggcttt tgagttcaag gtgcttgcta aggactcggg ggcgcccgcg cacttggaga 2400 
gcaacgccac ggtgagggtg acagtgctag acgtgaatga caacgcgcca gtgatcgtgc 2460 
tccccacgct gcagaacgac accgcggagc tgcaggtgcc gcgcaacgct ggcctgggct 2520 
atctggtgag ccactgtgcg cgccctagac agcgacttcg gcgtagagcg ggcgtctcac 2580 
ctacgaagat cgtggacggc aacgacgacc acctgtttga gatcgacccg tccagcggcg 2640 
agatccgcac gctgcaccct ttctgggagg acgtgacgcc cgtggtggag ctggtggtga 2700 
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aggtgaccga ccacggcaag cctaccctgt 
tgagcggatc ccttcccgag ggggtaccac 
tgtcgctgcc gctcatcgtg actctgagca 
tcaccatcgc cgtcaagtgc aagcgcgaga 
tcgccgagta cagccacccg cagctgggtg 
aaaatgatat catgctggtg cagagcgaag 
acgtggtgag cagcccctcc ctggccacct 
tgcccctcag ctcgccccgg tcggaggtga 
ctgtccctca ggggcacgcg ggctgccaca 
gcgagacccc tgccactcgg atgtccataa 
attacatggg cagcaggcag cagtttgttc 
cccagaaaga gccagcctga gagacagtgg 
ccaagacact aacaaaggct cctgctgtga 
aactacttca actaaaagcc aaccacttga 
agatgaatgc cgagtgcttg gtcattctga 
caatcaggct gaaaatgcag attaccgcac 
tgttgagact gagacttacg aaactgtgaa 
tggaaaagac aagcgagagc acactattct 
caaacgtgcc ctgagccctc tcctccaaga 
ggcgtgcatc gagccttgca cctcaacaaa 
aacccaggag cccctggctg aagcaagcag 
tgtcacctag taagcaacca agagaccctc 
tctttgcaga tgtgcattcc agagccagcc 
agcagcttga ccaccccaac agggatctgg 
tgagagaaat tgataagctt ttgcaagact 
agtgaaaaaa ganaaanaaa anggcattgg 
atgatccctc ctggtgataa cacattttac 
ggcttttagt gaacatctga agtgcccaca 
tcagagataa caatggtttc gttttgacca 
taaagagaaa agaaaaaaag agagaagaaa 
agaattacct tttgacaatc tgttaggaag 
atcactctca gactgtcctc cgtgatttat 
atacaaagcc aggggcataa tttgtgatct 
tctactgaaa gtcctgaaaa gaccttgcag 
atttgtgggt aaacatttct gtataaagtt 
tccatatcca tagtccgaaa accaaaacca 
atctccatgt catattaaaa gttaaatgta 
acgtagacta attcacgtca ttaaagaaga 
agcattttag tgtccaacaa agatttaaac 
aaaagttcta aaaaatagtt accattgagt 
atatgtttta ttcatttgtg gactactaat 
acatacgcta gtcacatgac catttaaatg 
atctattgac atattttatt gatttattgc 
gtgtgtaaga gctatattgt gtattttatt 
tttgggctta tattgtaaat ggacaagtgt 
agatttatgt tttttcctgc tttccttttc 
gagcaaagtc tcttgtttaa tgtatagtct 
aatgtgtttg agtgagctga tggagctaac 
tggttatgtg caagtccttg tagaagcttt 
acatttgtac caatatctga aacttcttca 
gtgtagatat tatgccatca gttggttctc 
tcccattata ttttgaaaac atgaaaaatg 
acttgcattg tgtagtgttt ttcaagaggt 
tcagtgctct tctgcctctt tttgttgggg 
tcatttggtt acacctttcc tttgtgacat 
ggctaagaga cctcagattt caggatttgg 
cagcagatgg gaaaatgcct tgattgacat 
ttacagcttc atgacaaaca gttttgtgcc 
ataaaggctg tttgagcact ggagcagaaa 
ttgtgccttc tcttctggcc accaagccag 
cttatattta aaacaaaaac aaaagcaaaa 
ttttaaactt taaaaacttc tactgaaaat 
ctgtacctcg tttgcgaaac tgctttgaga 
ttctcttgag aactatattt tggttcctag 
ttttattact cataaatgtt taaattagaa 
ttgactctct atattttgga caatttatgt 
tgttattttt gccaggagac tacaggttga 



ccgcagtggc caagctcatc atccgctcgg 2760 
gggtgaatgg cgagcagcac cactgggaca 2820 
ctatctccat catcctccta gcggccatga 2880 
acaaggagat ccgcacttac aactgccgca 2940 
ggggcaaggg caagaagaag aagatcaaca 3000 
tggaggagag gaacgccatg aacgtcatga 3060 
cccccatgta cttcgactac cagacccgcc 3120 
tgtatctcaa accggcctcc aacaacctga 3180 
ccagcttcac cggacaaggg actaatgcaa 3240 
ttcagacaga caattttccc gcagagccca 3300 
aaagtatttc agtagctcca cgtttaagga 3360 
gcacggggac agtgatcagg ctgacagtga 3420 
catgtctgtt agggaggcac tcaagatgaa 3480 
acaagaacca gaagagtgtg ttaattgcac 3540 
caggtgctgg atgccacagt tccctgcagc 3600 
aaatctcttt gtacctacag ttgaagctaa 3660 
tcccactggg aaaaagactt tttgtacatt 3720 
cattgccaac gttaaacctt atttaaaagc 3780 
ggtcccctca gcatcaagca gcccaaccaa 3840 
aggcctcccc tggatggcct gtgaaggcaa 3900 
tcagtacttg ccccactgac agtcaatatc 3960 
ccttcatggc ttccgatcag atggcaaggg 4020 
gggattccag tgagatgggt gctgttcttg 4080 
gcagagagtc tgtggatgca gaggaagttg 4140 
gccggggaaa cgaccctgtg gctgtgagaa 4200 
cattttcttg tctcttctgt tgatttaaaa 4260 
agggatgaag aaagaccaat gctgctttaa 4320 
agtatgttct ttccactgct gatttctttt 4380 
aacttgtatt aggacagaat taatgatgct 4440 
aaggagagat gaaaaaggag gatgaggaga 4500 
gtatgcagtg tgagaactga agtatttctg 4560 
gctgacttaa ctgtttacct ataaaccccc 4620 
gtggtggatt tctagcagtc atcacaggct 4680 
tagtccaagc tacaccaaac attaacacat 4740 
acctgacaca catataaaca caaggaacat 4800 
aaaaaaaacc tttggtccat ttgtaagacc 4860 
aaaagataca gtccattttg tcctgcacac 4920 
agaaaactta aagatttaaa atgcctattt 498.0 
aatgatgaat atgttttaaa tttgacatag 5040 
ggtaagattc agagaaaatt aacttgatta 5100 
tatagctcta ggaaagtgaa aatgtcttag 5160 
tgctaatgta agaagattca atgtgtttac 5220 
agattcagtg catatgagcc aaattgttga 5280 
aaattaatat atagttgtgt tgcaaaaata 5340 
tgccttggta gcctgtccga acctcctatg 5400 
cccatgggag tgtggggaag cagtgcctca 5460 
accaagtact acagtacata atctgttcaa 5520 
tgaaaggtca aaaattacat ccatcagtca 5580 
tattaaagtc atgctaaatc acaagaattg 5640 
tgttttttca ataacataca gcttctgcct 5700 
aaaagtattt taagtgcttc agatgtgtgt 5760 
ctttaatgca tgtatgtacc agcagtggtt 5820 
ctgggtctta acaaaatgtt ttcctttatc 5880 
tcctttgaga acaatacacc ttctattcct 5940 
ttagcgagtt tcaaacttac ttccatatga 6000 
gaaaaataaa attagcactt gcagaagtag 6060 
tttctttcag catttaaaat ttttggcatt 6120 
cataccttag aaaatgtggt gctgagttaa 6180 
aatgcattat ttgcaaactg gtggataatt 6240 
tgtagaaaca gcaaaaatgt cataaaaatt 6300 
acaaacattg aattacatta agttttgtca 63 60 
atttccgcca aatgccatca atattttaga 6420 
gggaagagtg gacaactccc atcagcctta 6480 
taacagcctt tccaaagctc tactcttggt 6540 
aagaagggac cttgtacatg tgaaacctaa 6600 
atctgaaatg tgttgtgctc tgttatatga 6660 
tttagcttga tagctgaaat ttgatggaaa 6720 
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actgatttcc atttagtctt accaagtgtt gcttctctct tactagacag atatccactt 6780 
agtaaaatct aaagcagtat gtaaatgaaa ccagcaaaga gagtagggtt tattttaata 6840 
aacattctta attgctaagt aaaccagttg ttcaatttat tatatgtgtc tgaggacatt 6900 
aaaacaccat taagattgta ataattggtt gtgcccaatg tgtgagggat ttacctttta 6960 
ggctctctgt caccagtgat ttactagtgt tagctgttta acacattatc tgtatttagt 7020 
agtgattatt tatttacaag ttggtggtaa ttcagcaggt caggagtcta agcttttata 7080 
gttgaattga ggaaatctcg cttttattca tttagctggc aactgccttt attgcagacc 7140 
tctggtgctt ggctttcaag gaagcctatg agatgccaaa atcacacctt tagagagcac 7200 
cttgctctaa tagpgtgatg catgagcaaa cagtgagatt tgaaggggtt ttaacataac 7260 
tttagaatgt gaaaacaata tcaattcata tctttcaagt actaacccct caaacaagcc 7320 
cacacataca aaactatgtg atgtgatacc actttgtctt ttaggtcttt aagtaactga 7380 
agttaagcac agaaaaaaaa atcacttcat ggaaatttca gtaagaaacc caaactttct 7440 
agaaaattgg cttggcagat gagctaagca aaaaaaaaaa aaaaaagcaa caaaataacc 7500 
ttttcatcag agttaaaagt agtgagaatc tggttagtta tacgtaatac caaaggtaaa 7560 
cattaaagaa gacattatca ggcaatttca aagaattctt tcatgctatt tcttaaccct 7620 
gacatttcta actttattgc aggcaatata caaagattgg ctcactactc cataggttaa 7680 
ttgaattcct ggttgagaaa ctaacttgtt ttgttttcca aaattagctg aaatcttgta 7740 
aaacatgact tccctttaaa ggatctagat attgttcaat ttaaaatatg gcaccataaa 7800 
aaagtcatgt agtaatagag catatgcttt tttagaacca ggttaaaagc tgtttgttat 7860 
ctaatagagt aaaagttact gaggtcaaca gatagacagg tctggcataa tatatgcccc 7920 
agtcataata gtctaattat attcttttga caacagacag cattcatcag agcactaatt 7980 
cgaatgagag ttttggctat caagatgtgt tgattctaaa tattattgaa tgagacctga 8040 
ttaaagtctg acttttcctg gccccagtgt tttgcaggtt ggctatttcc attatcagcc 8100 
catcactcca taaagttctt agctgccacc aagtgattgg tgaacaagga caacaacaaa 8160 
agcagcatac attgtatgga tttatctcaa cggctattgt ttaatggctg tgtttaaatg 8220 
tgatctatct ggaaaatgag gactccggaa taaaaagcta ttactaataa aaaaaaa 8277 

<210> 132 

<211> 1417 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID -No: LG: 982697 . 1:2000FEB18 
<220> 

<221> unsure 

<222> 788, 802, 1173, 1187, 1207 
<223> a, t, c, g, or other 

<400> 132 

gtggtgtggc tgcttctagc cacttatatt cagatgtggg agcaaagaca aaaggtagaa 60 
cttatattta aaagggaagc agagcaaaaa aagtttggaa aatttgcagc ctggccatgt 120 
ggcagagaaa gaaaaaagct ttttcagtag aggaattcaa gcagactgtg gaacaattac 180 
ttgctagaga tgtttgcagg attaaagggg agccaagtgc tactatccaa gacaataggg 240 
aaaaagcctt gaaagcattt caaagaactt cagagtggct tctcccatca caggcccaga 300 
agcttaggag gagagaatag attcatggac tgggcccagg gccatggtac cttgcacagc 360 
cttgggacac tgctcaccac atcctagctg ctcctgttcc cactcttgcc atggctcaaa 420 
ggggcctaca tacagcttgg actgccactt tggagaatgc aaacagtaaa ccttggtagc 480 
tttcatgtgg tgttaagcct gcaggcacac aaagtgcaag aatcgtggat tcttggcagc 540 
ctctgcctag atttcagagg acatatggaa aagtctgttt gtccaggcag aagcctgtta 600 
caggggtgga gcccttacaa agaacctcta ctagggcagt atagagggga aatgtggggt 660 
tggaggctcc aacagggtca ccgctggggc actgcctaat ggaactatac gaagtgggcc 720 
actgccctcc agacctaagg atgatagatc taccagcagc ttgtattctg cacctggaaa 780 
agccacangc actcaactgc ancccatgag cgcagccttg gattctttac cctgcaaagc 840 
cataggggca ggactgctca aggcctgggg agcccacccc ttgtaccagt gtggcctgga 900 
tgtggaacat gatgtcgaag attattttgg aactttaaga tttagtgact tccctactgg 960 
gttttggagc tgtgtggatc ctgtagaccc tttcttttgg ccaatttccc cctttttggg 1020 
atggaaacat ttacccaatc cctatacccc cattgtatct tggaagtaaa taatttgttt 1080 
ttgattttac aggctcataa gtggaaggga cttgccttgt ctcatatgag actttggact 1140 
tggacttttg aattaatgct ggaatgagtt aanactttga gggactnttg ggaagatata 1200 
actgtanttt gcagtatgag aaggacatga gatttgggag acaccagagg tggaataata 1260 
tgatttggat ctgcatcccc accaaaatct catgttcaat tgtaatccta aattttggag 1320 
gttgagcctg gtggaagagg ttggataatg ggggtggttt ctcatggttt aacaccatcc 1380 
ccctgggtgc tgttctcatg acagtgagtg agttatt 1417 



64/228 



WO 01/62927 



PCT/US01/06059 



<210> 133 
<211> 1143 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1080896. 1:2000FEB18 
<400> 133 

gccgacagca taatgtgagg 
caggctcctt tgaagatctt 
gaagtgtgga tcaaggactg 
cgtcctgatc actcccactg 
ccgtgtaaaa cctgcagcgc 
cagagtgacc ctgaagaaga 
ccacgcacag ccgaagcatg 
cttacagtaa gggcttcaac 
caagtcactg aggtaggaca 
tgtaccagga ctctaaaaga 
ccaggaaata accaacttga 
ttttaaaata caattaagga 
aacagaagaa aaaggggtcc 
tgatagtaag cagcatggaa 
cagaaaataa gtacatctgt 
gtgtcatttg age tact taa 
aaagtcagee cctcctgcac 
ccctcagacc caagtaaaat 
gatcttagtg taagcatcct 
aaa 

<210> 134 
<211> 899 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: LI:811341 ,1:2000FEB01 

<220> 

<221> unsure ^ 
<222> 500 

<223> a, t, c, g, or other 
<400> 134 

acctgacctg acaaaactct ttacactcta tgtgtcagaa agagaaaaaa tggcagttgg 60 
agttttaacc catactgtga ggccctggcc aaggccagtg gcctatctct caaaacaact 120 
agacaggatt tccaaaggct ggcccccagg tetaagggee ctagcagcaa cagccctgtt 180 
agcacaagaa gcagataaac taacccttag gaaaacctga atataaaggc cccccatgct 240 
gtggtaactt taatgactac caaaggacct cattggttac aaatgetaga ttaaccaagt 300 
accaaagctt gctatgtgaa aatccccaca taaccactga agtttgcaac accctaaccc 360 
caccaccttg ctcctggtat cagagagect agttgaacat actgtgtaga ggtgttggac 420 
tcagtttatt ctagcaggcc caacctccga gactatcctt aaacatcagt agactgtgag 480 
gggtacgtgg aegggagtan ttcgccaacc ectgeaaagt gactctgaag aagacgacaa 540 
gctctgctcc agtcacaccc agaagctgac tggtccacgc aeggecgaag catgagaaaa 600 
etcattgegg aactcatttt ccttaaaatt tggactttta cagtaaggac ttcaactgac 660 
cttcctcaga ctgaggactg ttccagtgta tacatcaagt cactgaggta ggacaaaagg 720 
ttgctacagt cctattattt tatggttatt ataagtgtac tggaactcta aaaataactt 780 
gtttgtataa tgttattctg tacaaggtat gtageccagg aagtgaccaa cctgatgtgt 840 
gttatgaccc atetgagect cccatgacca cggtttttaa aataagatta aggactgag 899 

<210> 135 
<211> 923 
<212> DNA 
<213> Homo sapiens 



caagtctagc tgttcccccc 
caggtggact tcacagaaat 
gaaegtagee tcttgtgccc 
ctgtgaacgt agagagaatc 
ctgaatcctg ggaggcaaga 
tgacaagccc tgctccagtc 
aggaagctca ttgtgggctt 
tgaccttcct cagactgaga 
aaagattget acattcctat 
aacttgtttg tataatgeta 
tgcgtgttat gacccatttt 
ctggtccttt tctaggtgac 
ccaaaaatgt aaccttaaaa 
taggatgegg ttctctaaat 
caaaaatcat atttatgega 
aaagaaaata aaaaaagatc 
aagtgggagc tgcatccttt 
aaaaaaaaat gtaacattag 
aataaaagga gaggttcaaa 



eggcatacaa gcttatagag 60 
tgccaagttg tggaggtgtc 120 
atggtgaagg acccagatgt 180 
ctagcctgga tccatcacaa 240 
ccaagtctgg acaacccctg 300 
acacccagaa gctgactggt 360 
catttttctt aaattttgga 420 
actgtttcca gtatatacat 480 
tattttatgg ttattataag 540 
ttctatccaa ggtatgtagc 600 
aagcctccca tgatcacagt 660 
acaagaaagg aaatagctag 720 
tttgacgett gtgecactat 780 
tggaaaaaaa agttacacag 840 
gatgtgtcaa tactggtctt 900 
ctgtttggct ccaaaaatga 960 
taaaattgat aatcacaaac 1020 
gcattgatgg aaaagggctg 1080 
aacgctctcc agaaccagta 1140 

1143 



65/228 



WO 01/62927 



PCTYUS01/06059 



<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 903225 . 1 :2000FEB01 



<220> 

<221> unsure 
<222> 239 

<223> a, t, c, g, or other 



<400> 135 

ggaggaaggt aatgaaggag attatcttat 
gggacagaag aatctcttga gtcctctaaa 
acaattcagt agtgtctgga atagaacagt 
cgggatgctg attcagtagg ccttgagtgg 
atgccctggc ttggggatca cattttgtga 
cccttccttc ggtgtttgag accacttcat 
aagtttcaaa cc cagtagag ttacctcaaa 
caccgaaagt gaagaaggaa gctcctggcc 
taaaggccaa gaaggtagtg ttgaaaggtg 
tgtcacccac cttccagcgg cccaagacac 
ggaagaccac ccccaggaga aacaagcttg 
ccactgagtt tgctatgaag aagataaaag 
ttaaagccaa caagcaccag atcaaacagg 
ccaaggtcaa caccctgatg gagagatgaa 
tgctttggat gttgccaaca aaattgggat 
aaatatacat atctcttttc acc 



cctacttatt attttacttc cttattctgt 60 
aaactatatg ctgatagcag gagttacaga 120 
ggtttgaaaa gtagagggct tgttaaagca 180 
ggataattta catatccaac aggccaggng 240 
ctgatgcaga gtactgatgg tcatccagtc 300 
ctggaccgag ctaaagtcta ggaagaaata 360 
gatacacttg agaccctttt cagaagatgg .420 
cgcctaaagc tgaagccaaa gcaaaggctt 480 
tccacggcca caaaaaaaag aagatccgca 540 
tgagactctg gaggccgccc agatatcctc 600 
accactatgc tatcatcaag tttcctctga 660 
acaacaacac ccttgtgttc actgtggatg 720 
ctgtgaagaa gctctgtgac attgatgggg 780 
ggcatatgtt ccactggctc ctgattatga 840. 
catccaaact gagtccatct ggctaattct 900 

923 



<210> 136 
<211> 343 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 242079 . 2 :2000FEB01 



<400> 136 

gctcagggtc ctgctgccct gcacaacctc aggaaactgc tccctgaatc ccagctacta 60 ■ 
cagctccagc tgtggctcaa aggaccccag gtatagctca ggctgcctgt ccagaggatg 120 
caagccataa gccttagtgg ctcccgtgtg gtgttaaatt aagcctgtgg gcacatagag 180 
tgcaagaatt gaggcttggg agcctccaac tagatttcag agtatgtgtg ggaaagcctg 240 
gatgtccagg cagaagccag ctgcagggac agagccctca tggagaacct ctactagggt 300 
agtgtggagg ggaaatttgg ggttggagtt cccacacagc ttc 343 

<210> 137 

<211> 1227 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 979580 . 1 :2000MAY19 
<400> 137 

cttggatatc tgctcatttg gcccgatgag aagagaaaaa cagaaaacaa aagctctgag 60 
tgggtagagc gtggggggag cttgaagctg ttattaaagg aaggtatgat taactcctaa 120 
ctcacagccg cttcagctca tctccatggc tcagccacca ggccagcctt tgctgtcacc 180 
taacagataa ttattgtctt ttctacaata tgattaagtg gaaggagaat cagaacactg 240 
tagaactgga tctgatattt cttgttagca tatatatata tatttaatct atgtgcttgt 300 
ccttgaaaac agcacttcct gtacccagca atcacagagc tgctcttgga atttaacacc 3 60 
ctggtggttc tcaaatgcca tctgaaaaac attattagtg ttacccatca tgacctctgg 420 
gtagtggaag gtgaatggga gaaatctctc cccatttgaa gaaataggga accagtctca 480 
ctttgttgcc caggctggag tgcagtggca caatcttgct cactgcaacc accacctccc 540 
gggttcaagc gatcctccca cctcaacctc ccaggtagct gggtctgcag gcgtgcgcca 600 
ccacacccgg ctaatttttg tgtttttagt acagaaagag tttcaccatg ttgaccaggc 660 
tggtctcaaa ctcctaacct caagtgattg gcccacctgg gcctcccaaa gtgctgggat 720 
tacaggtgtg agccactgca gcccagccta cgaagttgtt tttgcggtta agcagcaatt 780 
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tggaaacgaa gcatttctga ggagctcagt ttaagaaaca ctgtgatgtt gtgttttaaa 

cctatgaact cccctccgta gctcatccct ctacttttcc atctaacccc agtactggca 

ccttctctat gattagtaaa tttcttcagc tagaaatgtt acagtctgtt ttttaacaat 

gaagtcagga cccagtggat tatagcattg ttacacttca tgctcatgtt atataaaatt 

gcataaaatg tattaaaatt tatgtctgct cttatctctg ccttgcaaag aatgaatttg 
tatagctttt aactaagcat gtaacggggg cagcaccaca tctggaattc aagctgactt 

tctcacgtac cgtaatcttt gtgcttgggt ctaataagta caacccccat ggtgtggttt 
atcatttttt tattgtatgc acaatct 

<210> 138 
<211> 1259 
<212> DNA 
<213> Homo sapiens 

<220> * 
<221> misc_feature 

<223> Incyte ID No: LI : 1169865 . 1 :2000MAY01 
<400> 138 

ccggttgtag atgtatctct ccaggaagcg ctgtgtccca ttaaacgcgt agattcctgc 60 

cgtccctgga aaatgtgaat cccagccatg ctgattcctg tccagccatt tccagtgcta 120 

gaggcccaca gtttcagtct catgtgcctc cagtcggcct cagttcctca tcactgttcc 180 

tgtgctcaca gtcatcaatt atagacccca caacatgcgc cctgaagaca gaatgttcca 240 c 
tatcagagct g.tgatcttga gagccctctc .ct.tggctttc ctgctgagtc tccgaggagc 300 

tggggccatc aaggcggacc atgtgtcaac ttatgccgcg tttgtacaga cgcatagacc 360 

aacaggggag tttatgtttg aatttgatga agatgagatg ttctatgtgg atctggacaa 420 

gaaggagacc gtctggcatc tggaggagtt tggccaagcc ttttcctttg aggctcaggg 480 

cgggctggct aacattgcta tattgaacaa caacttgaat accttgatcc agcgttccaa 540 

ccacactcag gccaccaacg atccccctga ggtgaccgtg tttcccaagg agcctgtgga 600 

gctgggccag cccaacaccc tcatctgcca cattgacaag ttcttcccac cagtgctcaa 660 

cgtcacgtgg ctgtgcaacg gggagctggt cactgagggt gtcgctgaga gcctcttcct 720 

gcccagaaca gattacagct tccacaagtt ccattacctg acctttgtgc cctcagcaga 780 

ggacttctat gactgcaggg tggagcactg gggcttggac cagccgctcc tcaagcactg 840 
ggaggcccaa gagccaatcc agatgcctga gacaacggag actgtgctct gtgccctggg' 900 

cctggtgctg ggcctagtcg gcatcatcgt gggctccgtc ctcatcataa agtctctgcg 960 

ttctggccat gacccccggg cccaggggac cctgtgaaat actgtaaagg tgacaaaata 1020 

tctgaacaga agaggactta ggagagatct gaactccagc tgccctacaa actccatctc 1080 

agcttttctt ctcacttcat gtgaaaacta ctccagtggc tgactgaatt gctgaccctt 1140 

caagctctgt ccttatccat tacctcaaag cagtcattcc ttagtaaagt ttccaacaaa 1200 

tagaaattaa tgacactttg gtagcactaa tatggagatt atcctttcat tgagccttt 1259 

<210> 139 

<211> 2604 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:337818 . 2 :2000FEB18 
<400> 139 

cgcggagcct gggagaggag aaggagccga cctgccgaga tggaggcgac cggcacctgg 60 

gcgctgctgc tggcgctggc gctgctcctg ctgctgacgc tggcgctgtc cgggaccagg 120 

gcccgaggcc acctgccccc cgggcccacg ccgctaccac tgctgggaaa cctcctgcag 180 

ctacggcccg gggcgctgta ttcagggctc atgcggctga gtaagaagta cggaccggtg 240 

ttcaccatct acctgggacc ctggcggcct gtggtggtcc tggttgggca ggaggctgtg 300 

cgggaggccc tgggaggtca ggctgaggag ttcagcggcc ggggaaccgt agcgatgctg 360 

gaagggactt ttgatggcca tggggttttc ttctccaacg gggagcggtg gaggcagctg 420 

aggaagttta ccatgcttgc tctgcgggac ctgggcatgg ggaagcgaga aggcgaggag 480 

ctgatccagg cggaggcccg gtgtctggtg gagacattcc aggggacaga aggacgccca 540 

ttcgatccct ccctgctgct ggcccaggcc acctccaacg tagtctgctc cctcctcttt 600 

ggcctccgct tctcctatga ggataaggag ttccaggccg tggtccgggc agctggtggt 660 

accctgctgg gagtcagctc ccaggggggt cagacctacg agatgttctc ctggttcctg 720 

cggcccctgc caggccccca caagcagctc ctccaccacg tcagcacctt ggctgccttc 780 

acagtccggc aggtgcagca gcaccagggg aacctggatg cttcgggccc cgcacgtgac 840 

cttgtcgatg ccttcctgct gaagatggca caggaggaac aaaacccagg cacagaattc 900 

accaacaaga acatgctgat gacagtcatt tatttgctgt ttgctgggac gatgacggtc 960 
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agcaccacgg tcggctatac cctcctgctc ctgatgaaat accctcatgt ccaaaagtgg 1020 
gtacgtgagg agctgaatcg ggagctgggg gctggccagg caccaagcct aggggaccgt 1080 
acccgcctcc cttacaccga cgcggttctg catgaggcgc agcggctgct ggcgctggtg 1140 
cccatgggaa taccccgcac cctcatgcgg accacccgct tccgagggta caccctgccc 1200 
cagggcacgg aggtcttccc cctccttggc tccatcctgc atgaccccaa catcttcaag 1260 
cacccagaag agttcaaccc agaccgtttc ctggatgcag atggacggtt caggaagcat 1320 
gaggcgttcc tgcccttctc cttagggaag cgtgtctgcc ttggagaggg cctggcaaaa 1380 
gcggagtgct tcctcttctt caccaccatc ctacaagcct tctccctgga gagcccgtgc 1440 
ccgccggaca ccctgagcct caagcccacc gtcagtggcc ttttcaacat tcccccagcc 1500 
ttccagctgc aagtccgtcc cactgacctt cactccacca cgcagaccag atgaaggaag 1560 
gcaacttgga agtggtgggt gcccaggacg gtgcctccag cctcaacagt gggcatggac 1620 
agggttaatg tctccagagt gtacactgca ggcagccaca tttacacgcc tgcagttgtt 1680 
ttccggagtc tgtcccacgg cccacacgct cacttgactc atgctgctaa gatgcacaac 1740 
cgcacaccca tacacaacta caagggccac aaagcaactg ctgggttagc tttccacaga 1800 
cataaatata gtccatctgc aatcacaagc acatagccag gtaacccacc aactcccctg 1860 
gatctgcagc ccacacgtgg gagtctggct gtcaccttca caagccacag aaacggccac 1920 
acatgttcac agctcacacg ccctctccat tcatcgaact tctcagtgtc cctgtccctg 1980 
gtgcctggca cagggaacag catgccccct ccggggtcat gccacccaga gactgtcgct 2040 
gtctatggcc ccaactcatg ctccctctct tggctacacc actctcccag cctgtgacca 2100 
ccgatgtcca cacaccccca accacttgtc cacacagcta cccacgtacg acatcgtcct 2160 
ggctccccag agtatcttcc cactgagaca cgccgccccc acagaggcac agtccccagc 2220 
cacctctgca actgcagccc tcagtcaccc ctttttaagc accctgattc taccaaatgc 2280 
aaacacatct gggtctgcga ttatgcacag agactttgga catacgagga ccctcagacc 2340 
ggaggaacac ctgcccaacc ccaacacgtg cttatgtaac cacgtggaaa gcggcccctg *2400 
ctgcccctcc acacacacat acacactcac tgatctacag cccctgttcg gcgtcagagt 2460 
ccccactaga cccagtggaa ggggttagag accaagtagg ggccagtttc caattcaccc 2520 
tgtcagggag tgagccggat ctgacgttcc ttgtgactta agggtccggc ttgggaatta 2580 
aagtttgttt ctggccttta gcct 2604 

<210> 140 

<211> 2114 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 337818 . 1 :2000FEB01 
<400> 140 

cgcggagcct gggagaggag aaggagccga cctgccgaga tggaggcgac cggcacctgg 60 
gcgctgctgc tggcgctggc gctgctcctg ctgctgacgc tggcgctgtc cgggaccagg 120 
gcccgaggcc acctgccccc cgggcccacg ccgctaccac tgctgggaaa cctcctgcag 180 
ctacggcccg gggcgctgta ttcagggctc atgcggctga gtaagaagta cggaccggtg 240 
ttcaccatct acctgggacc ctggcggcct gtggtggtcc tggttgggca ggaggctgtg 300 
cgggaggccc tgggaggtca ggctgaggag ttcagcggcc ggggaaccgt agcgatgctg 360 
gaagggactt ttgatggcca tggaggaaca aaacccaggc acagaattca ccaacaagaa 420 
catgctgatg acagtcattt atttgctgtt tgctgggacg atgacggtca gcaccacggt 480 
cggctatacc ctcctgctcc tgatgaaata ccctcatgtc caaaagtggg tacgtgagga 540 
gctgaatcgg gagctggggg ctggccaggc accaagccta ggggaccgta cccgcctccc 600 
ttacaccgac gcggttctgc atgaggcgca cgggctgctg gcgctggtgc ccatgggaat 660 
accccgcacc ctcatggcgg accacccgct tccgagggta caccctgccc cagggcacgg 720 
aggtcttccc cctccttggc tccatcctgc atgaccccaa catcttcaag cacccagaag 780 
agttcaaccc agaccgtttc ctggatgcag atggacggtt caggaagcat gaggcgttcc 840 
tgcccttctc cttagggaag cgtgtctgcc ttggagaggg cctggcaaaa gcggagtgct 900 
tcctcttctt caccaccatc ctacaagcct tctccctgga gagcccgtgc ccgccggaca 960 
ccctgagcct caagcccacc gtcagtggcc ttttcaacat tcccccagcc ttccagctgc 1020 
aagtccgtcc cactgacctt cactccacca cgcagaccag atgaaggaag gcaacttgga 1080 
agtggtgggt gcccgggacg gtgcctccag cctcaacagt gggcatggac agggttaatg 1140 
tctccagagt gtacactgca ggcagccaca tttacacgcc tgcagttgtt ttccggagtc 1200 
tgtcccacgg cccacacgct cacttgactc atgctgctaa gatgcacaac cgcacaccca 1260 
tacacaacta caagggccac aaagcaactg ctgggttagc tttccacaga cataaatata 1320 
gtccatctgc aatcacaagc acatagccag gtaacccacc aactcccctg gatctgcagc 1380 
ccacacgtgg gagtctggct gtcaccttca caagccacag aaacggccac acatgttcac 1440 
agctcacacg ccctctccat tcatcgaact tctcagtgtc cctgtccctg gtgcctggca 1500 
cagggaacag catgccccct ccggggtcat gccacccaga gactgtcgct gtctatggcc 1560 
ccaactcatg ctccctctct tggctacacc actctcccag cctgtgacca ccgatgtcca 1620 
cacaccccca accacttgtc cacacagcta cccacgtacg acatcgtcct ggctccccag 1680 
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agtatcttcc cactgagaca cgccgccccc acagaggcac agtccccagc cacctctgca 1740 
actgcagccc tcagtcaccc ctttttaagc accctgattc taccaaatgc aaacacatct 1800 
gggtctgcga ttatgcacag agactttgga catacgagga ccctcagacc ggaggaacac 1860 
ctgcccaacc ccaacacgtg cttatgtaac cacgtggaaa gcggcccctg ctgcccctcc 1920 
acacacacat acacactcac tgatctacag cccctgttcg gcgtcagagt ccccactaga 1980 
cccagtggaa ggggttagag accaagtagg ggccagtttc caattcaccc tgtcagggag 2040 
tgagccggat ctgacgttcc ttgtgactta agggtccggc ttgggaatta aagtttgttt 2100 
ctggccttta gcct 2114 

<210> 141 
<211> 773 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 241577 .4:200 0MAY1 9 
<400> 141 

ccaggtcgcg ctcccactgc cgagcccgcg agatgctccc cagagctgcc tggagcttgg 60 
tgctgaggaa aggtggaggt ggaagacgag ggatgcacag ctcagaaggc accagtgaga 120 
cctggactcc ttcctccctg cctaactcca gagatagggg caggggtccc ctctgcaggc 180 
tgcccccttt gcccttccat gccaccctgg gcctgttcct accacacttc ccatgttccc 240 
atgatcccac tccttggtcc aaggtcaagt ttctccagga aatggag.tgc gcgggcccgt 300 
ggtgggggga agatgtcccc ctacaccaac tgctatgccc agcgctacta ccccatgcca 360 
gaagagccct tctgcacaga actcaacgct gaggagcagg ccctgaagga gaaggagaag 420 
ggaagctgga cccagctgac ccacgccgaa aaggtggcct tgtaccggct ccagttcaat 480 
gagacctttg cggagatgaa ccgtcgctcc aatgagtgga agacagtgat gggttgtgtc 540 
ttcttcttca ttggattcgc agctctggtg atttggtggc agcgggtcta cgtatttcct 600 
ccaaagccga tcaccttgac ggacgagcgg aaagcccagc agctgcagcg catgctggac 660 
atgaaggtga atcctgtgca gggcctggcc tcccgctggg actatgagaa gaagcagtgg 720 
aagaagtgac ttgcatcccc agctgtctcc ctggagtccg ccctggctgg gac 773 

<210> 142 
<211> 296 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID NO: LG: 344786 ,4:2000MAY19 
<400> 142 

cccatcttgt ggagcgtact ttcattttca atagagcttt gcttttgttg cttcattctt 60 
tccttgcttt gtgtgttttg tccaattctt tgttcaagac accaagaacc tagacaccct 120 
ccaccagtaa catattctag tgagccagcc aggagtttat tcaggatgat aacatggaga 180 
agcttgagga aattattgaa aaataccctc gtgccttccc tttctggatt gggccctttc 240 
aggcattttt ctgtatctat gacccagact atgcaaagac acttctgagc agaaca 296 

<210> 143 
<211> 1171 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI :414307 . 1 :2000FEB01 
<220> 

<221> unsure 

<222> 574, 583 

<223> a, t, c, g, or other 

<400> 143 

cgggatggga aaactatgcc tggggccgac gctctgcccg gctgctgccg ctgaggaaag 60 

ccgggacgcg gagccccgcc gagagcttct ttgctccgga cgcccctgga cgtggcgggc 120 

agccgcgagg gtaaccacca tgatcccctg ggtgctcctg gcctgtgccc tcccctgtgc 180 
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tgctgaccca ctgcttggcg cctttgctcg cagggacttc cggaaaggct cccctcaact 240 
ggtctgcagc ctgcctggcc cccagggccc acccggcccc ccaggagccc cagggccctc 300 
aggaatgatg ggacgaatgg gctttcctgg caaagacggc caagatggac acgacggcga 360 
ccggggggac agcggagagg aaggtccacc tggccggaca ggtaaccggg gaaagccagg 420 
accaaagggc aaagccgggg ccattgggcg ggctggcccc cgtggcccca agggggtcaa 480 
cggtaccccc gggaagcatg gcacaccagg caagaagggg cccaagggca agaaggggga 540 
gccaggcctc ccaggcccct gcagctgtgg cagnggccat acnaagtcag ctttctcggt 600 
ggcagtgacc aagagctacc cacgggagcg gctgcccatc aagtttgaca agattctgat 660 
gaacgagggt ggccactaca atgcttccag cggcaagttc gtctgcggcg tgcctgggat 720 
ctactacttc acctacgaca tcacgctggc caacaagcac ctggccatcg gcctggtgca 780 
caacggccag taccgcatcc ggacctttga tgccaacacc ggcaaccacg atgtggcctc 840 
aggctccacc atcctggctc tcaagcaggg tgacgaagtt tggctgcaga tcttctactc 900 
agagcagaac gggctcttct atgaccctta ctggacagac agcctcttta cgggcttcct 960 
aatctatgcc gaccaggatg accccaacga ggtatagaca tgccacggcg gtcctccagg 1020 
cagggaacaa gcttctggac ttgggcttac agagcaagac cccacaactg taggctgggg 1080 
gtggggggtc gagtgagcgg ttctagcctc aggctcacct cctctgcctc tttttttccc 1140 
cttcattaaa tccaaagctt tttattcatc c 1171 

<210> 144 
<211> 372 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__feature 

<223> Incyte ID No: LI: 202943 .2 :2000FEB01 
<400> 144 

gtgtgaaagc cacttattat tccaccaaaa ctgtctgttt ctagaaccta taaaatgtaa 60 
ggctccagga aatccggaaa atggactcct caggtgagat ttatacagta ggtgccgaag 120 
tcacattttc gtgtcaggaa ggataccagt tgatgggagt aaccaaaatc acatgtttgg 180 
agfcctggaga atggaatcat ctaataccat attgtaaagg tatgtttagt aaatttacca 240 
cgtttcttat gtttggcaat cctagaaagg taaggcgtag acatatgaag tgttatgttg 300 
tatcaatgtt ctttgtattt gaactgaggt attaggctca tgtttgaaca tgaccccaca 360 
cacacacaca cc 372 

<210> 145 
<211> 337 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 246194 . 2 : 2000FEB01 
<220> 

<221> unsure 

<222> 73, 82, 105, 126, 154, 334 
<223> a, t, c, g, or other 

<400> 145 

tcctcaaccc accaccgccg cagcactggt gcccccaccc 
ccagccacca ggngctcctg cnctgctgcc tccgccgcac 
gttggngccc ccactcctgc atccaccacc tgcncagtcc 
tcgggctcca ctgccaggtc agatgctgct gagcgggggt 
gccgggcctg cagcccagcg tcatggagga cgacatcctc 
caacacccaa taaagttcct ttttaacaca aaanaaa 

<210> 146 
<211> 866 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI: 815961 . 1 :2000FEB01 



caggcctccc tccaccacct 60 
caggncctgg ggcagcccca 120 
tggcccgcac aacttccccc 180 
ccccggggcc cggtccccca 240 
atggatctca tctgaatccc 300 

337 
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<400> 146 ' 

gaaaggatgt tgcagtcata ggacctgtgg aagagatggg actgctgcct gtagttgggg 60 
aatgtgtggt gaagggtatg gttagcctgc tgctggtggc cgaggtggtg tgggccacag 120 
gggttctggt gcctgtactg gtgtggttgg gggtgatggt ggtggtagaa gttggggtga 180 
cttcaggata gtgtgtggag gaagtgtgtg aatgtaagga tgtagaggtt ttgcctgtgc 240 
taaatgagct tgcggatggg ctggtcccac tggtggtcac tgtgattggt ggggctgtgt 300 
gggtggaccc tgtgaccttg agcgttgtta gtggaggaat ggtgcctgtt ggcgttgagt 360 
ggatggaggc agaagtggac atctgtgcat gggtaggggt gatgaccttg tgagtacttg 420 
gagtcaccaa ggaggtggag aaaggtggaa tgtgagtggg aagtgtggtc tgagggtgtg 480 
atggggttgg ataggtagtg gtggtctgaa aggatggtgc agtcatagga cctgtggaag 540 
agatgggact gctccctgta ggtggggaat gtgtggtgaa gggtatgggt agcctgctgc 600 
tggtggccga ggtggtgtgg gccacagggg ttctggtgcc tgtactggtg tggttggggg 660 
tgatgttggt ggtagaagtt ggggtgactt caggatagtg tgtggaggaa gtgtgtgaat 720 
gtaaggatgt agaggttttg nctgtgctaa atgagcttnn ggnttggctg gtcccactgg 780 
tggtcagnat cattggtggg gctgtgtggg tggaccctgt ggccttcagc gttgtcngtg 840 
gaggaatggt gcctgttggc attgag 866 

<210> 147 

<211> 1115 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> incyte ID No: LG: 120744 . 1 :2000MAY19 
<220> 

<221> unsure 
<222> 776 

<223> a, t, c, g, or other 
<400> 147 

cgctgcaggg agaacggact ccgggcggag ggcagccaat ccgtttcagc gcaggtcttg 60 
ctcgggttgg gcttgccact gcctggaaca tacctgtccc cctggcgcaa cactcagctg 120 
gctgcgacgg caaccccgag cctggacact gcgccaggaa tcctaaaacc aaaatattag 180 
aacgaaaaca gaaacatggc tcactatatt acatttctct gcatggtttt ggtgctgctt 240 
cttcagaatt ctgtgttagc tgaagatggg gaagtaagat caagttgtcg tactgctccg 300 
acagatttag ttttcatctt agatggctct tatagtgttg gcccagaaaa ctttgaaata 360 
gtgaaaaagt ggcttgtcaa tatcacaaaa aactttgaca tagggccgaa gtttattcaa 420 
gttggagtgg ttcaatatag tgactaccct gtgctggaga ttcctctcgg aagctatgat 480 
tcaggagaac atttgacggc agcagtggaa tccatactct acttaggagg aaacacaaag 540 
acagggaagg ccatccagtt tgcgctcgat tacctttttg ccaagtcctc acgatttctg 600 
actaagatag cagtggtact tacggatggc aaatcccaag atgacgtcaa ggatgcagct 660 
caagcagcaa gagatagtaa gataacatta tttgctattg gtgttggttc aga'aacagaa 720 
gatgccgaac ttagagctat tgccaacaag ccttcgtcta cttatgtgtt ttatgnggaa 780 
gactatattg caatatccaa aataagggaa gtgatgaagc agaaactttg tgaagaatct 840 
gtctgtccaa cacgaattcc agtggcagct cgtgatgaaa ggggatttga tattctttag 900 
gtttagatgt aaataaaaag gttaagaaaa gaatacagct ttcaccaaaa aagataaaag 960 
gatatgaagt aacatcaaaa gttgatttat cagaactcac aagcaatgtt ttcccagaag 1020 
gtcttcctcc atcatatgta tttgtgtcta ctcaaagatt taaagtcaag aaaatttggg 1080 
atttatggag aatattaact attgatggaa ggcca 1115 

<210> 148 
<211> 2448 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI:757520 . 1:2000MAY01 
<400> 148 
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tgagcctcgc actctgccgc ccgcaccacc ttccgctgcc tctcagactc tgctcagcct 60 
cacacgatgt cgtgccgctc ctacaggatc agctcaggat gcggggtcac caggaacttc 120 
agctcctgct cagctgtggc ccccaaaact ggcaaccgct gctgcatcag cgccgccccc 180 
taccgagggg tgtcctgcta ccgagggctg acgggcttcg gcagccgcag cctctgcaac 240 
ctgggctcct gcgggccccg gatagctgta ggtggcttcc gagccggctc ctgcggacgc 300 
agcttcggct accgctccgg gggcgtgtgc ggacccagcc ccccatgcat cactaccgtg 360 
tcggtcaacg agagcctcct cacgcccctc aacctggaga tcgaccccaa cgcacagtgc 420 
gtgaagcagg aggagaagga gcagatcaag tccctcaaca gcaggttcgc ggccttcatc 480 
gacaaggtgc gcttcctgga gcagcagaac aagctgctgg agaccaagtg gcagttctac 540 
cagaaccagc gctgctgcga gagcaacctg gagccactgt tcagtggcta catcgagact 600 
ctgcggcggg aggccgagtg cgtggaggcc gacagcggga ggctggcctc agagctcaac 660 
catgtgcagg aggtgctgga gggctacaag aagaagtatg aagaggaggt ggccctgaga 720 
gccacagcag agaatgagtt tgtcgttcta aagaaggacg tggactgtgc ctacctgcgg 780 
aaatcagacc tggaggccaa tgtggaggcc ctggtggagg agtctagctt cctgaggcgc 840 
ctctatgaag aggagatccg cgttctccaa gcccacatct cagacacctc ggtcatagtc 900 
aagatggaca acagccgaga cctgaacatg gactgcatca tcgctgagat caaggctcag 960 
tatgacgatg ttgccagccg cagccgggcc gaggctgagt cctggtaccg tagcaagtgt 1020 
gaggagatga aggccacggt gatcaggcat ggggagaccc tgcgccgcac caaggaggag 1080 
atcaacgagc tgaacbgcat gatccagagg ctgacggccg agattgagaa tgccaagtgc 1140 
cagcgtgcca agctggaggc tgctgtggct gaggcagagc agcagggtga ggcggccctc 1200 
agcgatgccc gctgcaagct ggctgagctg gagggcgccc tgcagaaggc caagcaggac 1260 
atggcctgcc tgctcaagga gtaccaggag gtgatgaact ccaagctggg cctggacatc 1320 
gagatcgcca cctacaggcg cctgctggag ggcgaggaac acaggctgtg tgaaggtgtg 1380 
ggctctgtga atgtctgtgt cagcagctcc cgtggtggag tctcctgtgg gggcctctcc 1440 
tacagcacca ccccagggcg ccagatcact tctggcccct cagccatagg cggcagcatc 1500 
acggtggtgg cccctgactc ctgtgccccc tgccagcctc gttcctccag cttcagctgc 1560 
gggagtagcc ggtcggtccg ctttgcctag. tagagtcatg gagccagggc ttcctgccaa 1620 
gcacctgcct gcctgcatca ctgcactgaa tggcatgtga atggaaaatg tgtgcttgct 1680 
tccagaatct tctggatgtt cctacagagg ggaagaccta cagagggaaa gaccctcggg 1740 
ccgctcccct gcgccttttc atgctaggga gatgcatcct agttgtcctc ctggcagctg 1800 
ttttcagagg cattcccagc ccttcactta actcctactt agctccaaaa tacctgtatc 1860 
caatttgtat tattccccca gctctcaggg acaagaccag tcccccagcg tggtggtcag 1920 
cacggaagct ccaccttctg ggtggaggcg ccatcctaac catccagcca ggccacccac 1980 
aacccgagaa tcagggagaa agtccctccc cagcagcccc ctcctcctgg ctgggaagaa 2040 
tggtccccca gcaagcactt gcctgttcat tcccgttcat gttttgcttc tctctcagac 2100 
tgccttcctg cttctgggct aacctgttcc aagccaggct cctcaatgtg acctcgcagt 2160 
tgagaagccc attatcgtgg ggcatccttt tgcctacagc ccctggttag ggcactttgg 2220 
acaggtcttg ctattcagtg aacctttgta catttcaaag aagactccat ggctgctcca 2280 
gatgccccct tgctgggtgc aggtggggac tgtccaatgc agagctggcg ggacagagag 2340 
ttaagccact tcctgggtct ccttcttatg actgtctatg gggtgaattg ccttctgggg 2400 
ttgtctcgat ctgtgtttca ataaatgccg ctgcaatgca aaaaaaaa 2448 

<210> 149 
<211> 650 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 160570 . 1:2000FEB18 
<220> 

<221> unsure 
<222> 609 

<223> a, t, c, g, or other 
<400> 149 

gcttccccgg cggctcccgg gcgctgactc tcccgggctt ttccgccttt ctcgctcccg 
ccccgccggg cggttcggcg gccggtcccc ggaccccgcc gggagctgct cgtctcaccc 
tcgccacggc gcggacaata aattaatgca tcgcgcttga gtggaggggg cggctgggcg 
cctcggtggt agctttctct cctggctgga gacgaccaca accgacatgg gctgtttctg 
cgctgttccg gaagaatttt actgcgaagt tttgctcctg gatgaatcca agttaaccct 
taccacccag cagcagggca tcaagaagtc aacgaaaggt tccgttgtcc ttgaccacgt 
attccatcac gtaaaccttg tggagataga ttattttggg ctacgttact gtgacagaag 
ccatcagacg tattggctgg atcctgcaaa aacccttgct gaacacaaag aactgatcaa 
cactggacct ccatatactt tgtattttgg tattaaattc tatgctgaag atccatgtaa 
acttaaagaa gaaataacca ggtatagtat tgactttgtt tttgaacaga tacatgcatt 
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gcgtatacna aaagctttat ttaaaacaaa ttaaaattgg gctaaacgtt 650 

<210> 150 
<211> 257 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 35 0398 .3 :2000FEB01 



<220> 

<221> unsure 

<222> 22, 117, 120, 220, 246 
<223> a, t, c, g, or other 



<400> 150 

cagctgctcc tgacactccc anaactgtcg ccagcgggga ggggaggcag tagggactgc 60 
cgaagcctgg ggccctggtg ggtgacaggg cagcaccagc ctgggtccag ccacctntcn 120 
cacaggtgaa ccgcttccac aagatccgga accgggccct cctgctcaca gaccagcacc 180 
t'ctacaagct ggaccctgac cggcagtacc gggtgatgcn ggccgtgccc cttgaggcgg 240 
tgactngctg agcgtga 257 

<210> 151 
<211> 1554 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID'No: LI : 221285 . 1 :2000FEB01 



<400> 151 

gcctggctgc 

aggagaaaca 

ccaacaccag 

cgcaggcctc 

caaagcgcag 

cttgtgccgc 

cagacagaac 

ctgggagcga 

ctcccttgtt 

tctccgggtc 

gaggcagtca 

gcacctacac 

acagcctgac 

gggtaattcg 

ctctagagct 

aagagcagct 

aaacagaatt 

aaacagatgc 

aagatgtggc 

cagtgttaga 

aagaagagct 

aacgtgctga 

ggcgtaagaa 

ccgcccgagc 

tcagggatag 

catggctaat 



caggggcgtg 
gccgaggaca 
ccctgcaaag 
aaggatgacg 
gcgcacctac 
agtccttccc 
cctgggttgc 
gcccagtata 
gtcgcctctc 
ccacaacacc 
gcccccggac 
ctacaccagc 
gcagccagat 
aggtaacact 
ccagagacaa 
cagaccacaa 
ataccttgaa 
atttttggac 
cacccaaata 
agttttggtg 
ggctaacctg 
agttcaacga 
acagcagtgg 
atttgcacag 
tggctacttt 
gaatgaagtt 



ttatcgcggg 
ccctggaaga 
ccacattcca 
gtcaagccag 
ctgggtggcg 
gggaaacctg 
tgggcaaccg 
gctcccacct 
ctcgcgtcgc 
ttagcgccca 
atggcctcag 
cggccccgag 
gaagaaccta 
tatgcactcc 
cgggaggcta 
acacctgaac 
gaaattgctg 
agaccaccaa 
ctagaaggag 
gggaagacaa 
cgggccagtc 
cttgaagagc 
gaaataatgc 
cgttacctgg 
tatgatccca 
gaaaaaacca 



ggcagcctgg 
gcaggagcac 
aagccgggcc 
ccaaggccgc 
cggctgggcg 
gggaccggaa 
atgcggcccc 
cctgcgctgg 
ggaatccttg 
cctgcttcaa 
cgctgactga 
cactgccctg 
tgcattatgg 
agacagggcc 
ggaagagggc 
ctgtggaagg 
atcgcataat 
caccactctt 
agctctttga 
ttgagcagtc 
agcgtgagta 
aagagaggcg 
acaagcacaa 
ctgaccttct 
ttgaaagaga 
tggaatatag 



gagcgctgct 
gtgggcctgc 
aaggacagcc 
ctccctagcc 
gagtcaagag 
ctgcccggag 
cgcagctggt 
taacctcccg 
cccttggcac 
agccaagctc 
tcgcacctct 
ccagcgcagc 
aaacataatg 
actgctcgga 
tcttgccaga 
cagaaagcat 
agaagttgat 
tattcctgcc 
ctttgatctt 
tcttctggaa 
tgaagaacta 
acaccgagaa 
cgagacatca 
cccgtctgtt 
tattgagata 
catggtggga 



gcgccgcggc 
gagaatggcg 
accgttgccc 
aggaacctcg 
ccggaagtcc 
ttcccgcccc 
ctctgtggtg 
tcccgcccgc 
tacttacatc 
caccgaaagc 
cgggccccga 
cgttaccggg 
tatgacagaa 
cggcctgatt 
aaacaagccc 
gtcgatgtgc 
atggaatgcc 
aaaactggca 
gaagttaaac 
gtaatggaag 
cggaatagtg, 
gaaaaagaac 
caaaaaatcg 
tttggcagcc 
ggatttcttc 
agaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1554 



<210> 152 
<211> 623 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
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<223> Incyte ID No: LI : 401605 . 2 :2000FEB01 
<400> 152 

ctggcagggc tggaacatga agaaagagaa gaattttatt tatctacgcc agaaaactac 60 
cactacttga atcagtctgg atgtgtagaa gacaagacaa tcagtgacca ggaatccttt 120 
agggaagtta ttgtaagtta cttttaaatg ttttccctat taagatatca gataccttat 180 
taagagtaac agatttcatt ttcggtttca aaagattttt ttaaatagaa atagaagtaa 240 
ttccatttgg accagtgggt gagaacaggt atttccttct tgttccaagt catgaacttg 300 
ttccaagttc taggctgaaa ggggtttgag gactcttctt tccttagaag atgagcgaat 360 
gtgtcattca gggcagatgg gcagtttact tgggacagtg tgttcagaaa gtgtaccttc 420 
cacccccaaa aaacccccca aatcatgggc ttcactctgg aaccagcaga tcctccattt 480 
tggggcttac aagtaagatt tctttgaaaa acaaattcta tagctaaaaa caaatctgcc 540 
agccattggc ccaggtgact gttgttgcag aattttggct ctttgttcag ctaaaacctg 600 
ggggtcttgt cacacgatca gga 623 

<210> 153 

<211> 1038 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 329017 . 1 :2000FKB01 
<400> 153 

ctgggctgga ggctctagca gttgttgccc acctggctgt ccatggcaaa agtgggtggg 60 
gtcatgtgca gcctcctgcc acctgagtgt ttcccgggac atcaggaggc tgtatcctgc 120 
agctgaattt acacagaagt ggaacaactg ggctggaagc tctatggcga caaattggct 180 
acttcaagtg gtgacactac agttaaatta tgggatctat gtacaggcga ttgcattttg 240 
acctttgaag gacacagccg cgcagtgtgg tcctgcacat ggcactcctg cggcaatttt 300 
gtggcttcct cctcactgga taaaactagc aaaatttggg atgttaatag tgaaagatgc 360 
agatgtactt tgtatggaca tacagattct gtgaacagca ttgagttttt tcctttctcc 420 
aatactcttc tcacaagctc tgcagacaag accctgtcta tatgggatgc aagaacaggt 480 
atatgtgagc agtcacttta tggtcacatg cattctatca atgatgccat ttttgatccc 540 
aggggtcaca tgatagcatc ctgtgatgcc tgtggggtta caaagctgtg ggactttcgg 600 
aagctgttac caattgtgtc catcgatata ggtccaagtc ctggcaatga ggtgaatttt 660 
gattcatcag gtcgagtttt agctcaggca agtggcaatg gtgttatcca tttgctagat 720 
cttaaatctg gggagattca caaattgatg ggccacgaaa acgaggcaca cacggttgtg 780 
ttttctcacg acggggagat tctcttttct ggaggctctg acggcacagt tcgaacgtgg 840 
tcttgaccgt cagcacatcc cgctgcagag ggcattccct ttaaggcttg aaaatgcctc 900 
ctgtagtccc aaaccagcaa agaactggtt aaatgtgcca gcaggtaaca tttacagtct 960 
gctttaaaac atgctttgga catatatttt agtgctcata ctgttactaa taaaaaaata 1020 
actttatgct caaaaaaa 1038 

<210> 154 
<211> 737 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 401322 . 1:2000FEB01 
. <220> 

<221> unsure 

<222> 601, 625, 633, 698, 700, 711 
<223> a, t, c, g, or other 

<400> 154 

gcgacccaat gggagtcccc gccaggagac acaaccggcg gctctccggg gatttataag 60 
gtcccgccag caaccgcgcg ttgggattcg tggtgttgct ggagacccgt atggtgtctg 120 
cagtcagcga ccctccaact tcaactactg gatcaaccat gtgtgagtgc atctccatcc 180 
atgtgggcca ggccggagtg cagatgggca atgcctgctg ggagctgtac tgcctggagc 240 
acgacatcca gcccagtggc accatgccca gccacaaggc cctggggagc agtgataact 300 
ccttcaacac cttcttcagg gagacccagc ctggcaggca tgtgtcctgg gctgtctgtg 360 
gacctggagc ctgctgtcat aggttggcat caactaccag tcccccacag tggtgcccgg 420 
gggtgctgta gccaaggtgc agcgggcagt ctgcgtgcta aacaatacca cagccatcac 480 
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tgaggcctgg ggcccgcctc aaccaaaagt ttgacctgat gtatgccaag cgggcattta 540 
tgcactgtta tgtggacagg ggcatggagg aaggtgagct ctctgggcct gagaggacct 600 
ngctgccctg gagaaggatt atgangtagg canagaatct ttggacagcg aggatgagca 660 
tgaggagttc taggggtcct ggcacctgta acacctgntn tctcgcacca nagttgtgtg 720 
tgtatctcat tccactg 737 

<210> 155 

<211> 2275 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 403409 . 1 :2000MAY19 
<400> 155 

gtggccgcgg cagacggaag ccgaacgagt tcctcggcgg ctgcaggatg ggggactcca 60 
aagtgaaagt ggcggtgcgg atacgaccca tgaaccggcg agagactgac ttgcatacca 120 
aatgtgtggt ggatgtggat gcaaacaagg ttattcttaa tcctgtaaat acgaatcttt 180 
ccaaaggaga tgcccggggc cagccgaagg tgtttgctta tgatcattgt ttctggtcta 240 
tggatgaatc tgtcaaagaa aagtatgcag gtcaagatat tgttttcaag tgccttggag 300 
agaatatcct gcagaatgct tttgatggct acaatgcatg tatctttgcc tatggacaga 360 
ctggctctgg aaaatcttat accatgatgg gcacagctga ccaacctgga ttaatcccaa 420 
gactttgcag tggactcttt gaacgaactc agaaagagga aaatgaagaa eagagtttta 480 
aagtagaagt gtcctacatg gaaatttata atgaaaaagt tcgagacctt cttgatccca 540 
aaggaagccg tcagacgttg aaagtcagag agcatagtgt gttgggacct tatgtcgacg 600 
gactttctaa actggctgtc acaagctaca aggatattga gtcgttgatg tctgagggta 660 
acaaatctcg cacagttgct gcaaccaaca tgaacgagga gagtagccga tcccatgcag 720 
ttttcaaaat caccctcaca catactctct acgatgtgaa gtctgggaca tctggagaga 780 % 
aagtgggcaa actcagcctg gtggatttag ctggcagtga acgagcaacg aagacaggc§ 840 
ctgcagggga caggctgaag gaagggagca acattaacaa gtccctcaca accctcggtc 900 
tggttatctc agctcttgca gatcagagtg ctggcaaaaa caagaataaa tttgttccat 960 
atcgtgactc agttctcact tggctgctca aagacagcct cgggggtaac agcaagaccg 1020 
ccatggtggc tactgtgagt cctgcagctg ataactatga tgaaaccctc tcaactctgc 1080 
ggtatgcaga tcgagccaag cacattgtaa accacgctgt ggtgaatgag gaccctaatg 1140 
cccgaattat ccgggatctc cgggaagaag ttgagaaact ccgggagcag ctgaccaaag 1200 
cagaggcaat gaaatctcca gagctaaagg accggctgga agaatctgag aagctaatcc 1260 
aggaaatgac tgtgacctgg gaggagaaat taaggaaaac ggaggagatt gcacaggaac 1320 
gacagaaaca gcttgagagt cttggaatat ctcttcagtc ttcgggaatc aaagttgggg 1380 
atgataaatg cttccttgtg aatctgaatg ctgacccagc tctgaatgag cttctggtgt 1440 
actatttaaa ggaacataca ttgatagggt cagcaaattc ccaagatatc caactgtgcg 1500 
gcatgggaat tcttcctgaa cactgtatta tagacatcac gtcagaaggc caggttatgc 1560 
tgactcctca gaagaacacc agaacatttg taaatgggtc atctgtctcc agtccaatac 1620 
agctacacca tggggacagg atattatggg gaaacaatca tttcttcaga ctcaatttgc 1680 
ctaaaaagaa aaagaaagca gaacgagagg atgaggacca ggatccctcc atgaagaacg 1740 
agaatagttc tgagcagctg gatgtagacg gagactcctc cagcgaggtg tccagtgaag 1800 
ttaactttaa ttacgaatac gcacagatgg aggtcaccat gaaggccctg ggcagcaatg 1860 
atccgatgca gtccatatta aacagcttag aacaacagca tgaagaagaa aaacgatctg 1920 
cactggagcg ccagaggctt atgtatgagc acgaattgga gcagctccgg agaaggctgt 1980 
ctcctgagaa gcagaactgc cggagcatgg acaggttttc tttccactcg cccagcgctc 2040 
agcaacgctt aagacagtgg gctgaggaga gagaagcaac gttgaataac agcctgatga 2100 
ggctgaggga acaaattgtt aaggccaatc tattggtgag agaagctaat tacrattgctg 2160 
aggagctgga taaaagaaca gaatacaaag ttaccctaca gattccagcc tccagcctgg 2220 
atgccaacag gaagcgaggc tctcttctta gtgagcctgc aatccaggtg agaag 2275 

<210> 156 
<211> 488 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG:233933 . 5 :2000MAY19 
<400> 156 

ctccacgcca gcacttttgc atgcagtact cccttcaggt cgtcacagcc acctgcaagt 60 
ttgggatgaa tgcgcttctc ctgtctgcct ggttcggcca cttacgaatc ctccagatct 120 



75/228 



WO 01/62927 



PCT/DS01/06059 



tggtaaactc aggggccaag atccactgtg agagcaagga ggggaacact gcccttcatc 

tggctgctgg tcggggccat atggctgtgc tgcagcgact tgtggacatc gggctggacc 

tggaggagca gaatgcggaa ggtctgactg ccctgcattc ggctgctgga ggatcccacc 

ctgactgtgt gcagctcctc ctcagggctg ggagcaccgt gaatgccctc acccagaaaa 

acctaagctg ccttcactat gcagccctca gtggctcgga ggatgtgtct cgggtcctca 

tccacgcagg aggctgcgcc aacgtggttg atcatcaggg tgcctctcct ctgcacctcg 
ctgtgagg 

<210> 157 
<211> 5106 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI: 290344 . 1:200 0MAY01 
<400> 157 

gggaagtggg agagacagag ggagagtgtg catacagcag ctgtcgggca tccctgtctc 60 
agggacttct tcgctgattc acagaggcag cccagccccg gcctcccagc ttacagctgc 120 
cgcctgcttc tcagaacctg acggaatcag aagctacatg atattcgccg tgtgagtggc 180 
tatcgtaata gataacattt gaatatctac taaatgccag aaactatgaa ttaactcctc 240 
tcaacaaagc taagaggtgc ttgatgatca atagttggtg cactgcctat cctcatcccc 300 
cccatgagcc ttggttttta tgcaacctat ggtcatgcag ggatgccc.tt acaccctccc 360 
acgatgtcat gactggcagg cagctgacca gttccatcat agcagcagcc tccgaagcac 420 
ctgcccccac cctcaggtta gagctgctgt caccagccct gcacctcctc aagatggtgc 480 
tggggttccc tgcctaagcc taaagctgtt gaacgggtct gttggtgcct ctggacccct 540 
ggaaccacca gccatgaatc tgtgttggaa tgaaataaaa aagaagtctc acaacctccg 600 
cgctcgccta gaggccttct cagaccacag tggaaagctt cagctccctc ttcaagagat 660 
tattgactgg ctcagccaaa aggatgagga gttgtcagct cagctgcccc tacaggggga 720 
tgtggccctg gtgcaacagg agaaggagac acatgcggcc tttatggaag aagtcaagtc 780 
tcgggccccc tacatctatt ctgtgctgga gtcagctcag gccttcctgt cccagcaccc 840 
atttgaggag ttagaggagc ctcattctga gagcaaagat acctccccga aacagcggat 900 
ccagaatctc agccgctttg tatggaagca ggcgacggtg gccagtgaac tgtgggagaa 960 
gttgacagcc cgctgtgtgg accagcaccg tcacattgag cggactctgg agcagctctt 1020 
ggagattcaa ggggcaatgg aggaactaag cactactcta agccaagctg agggagtccg 1080 
agccacttgg gagcccattg gggatctctt cattgattca ctcccagagc acatccaggc 1140 
tattaagctg ttcaaagaag aattctcccc catgaaagat ggagtaaagt tggtgaatga 1200 
tctggcccac caacttgcca tttctgatgt gcacttgtca atggagaatt cccaggccct 1260 
ggaacagatc aacgtccgat ggaaacaact acaggcgtca gttgatgaga ggcttaagca 1320 
gctccaggat gcccaccggg actttgggcc tgggtcacag cactttctct cctcctctgt 1380 
ccaggttccc tgggaaagag caatttcacc caataaagtt ccctactaca tcaaccacca 1440 
ggctcagacc acatgctggg accatcccaa gatgacagag ttataccaaa ccctagctga 1500 
tctgaacaac attaagttct cage t tat eg cactgccatg aaactccgca gagtccagaa 1560 
agccctgcgc ttggacctgg taactttaac cacagccctg gaaatcttca atgagcatga 1620 
tctgcaggcc agtgagcacg tgatggatgt ggtagaggtc attcactgcc tgactgeett 1680 
atatgaacgt ttggaggagg aaagaggcat cctggtcaat gtgccactct gtgtggacat 1740 
gagectcaat tggctcctca atgtttttga tagtggtcgc ageggaaaga tgegggcatt 1800 
gtcttttaag actggcattg catgcttgtg tggcacggaa gtgaaggaaa aacttcagta 1860 
cctcttcagc caagtggcca actcaggcag ccagtgtgac cagcgccacc ttggtgtcct 1920 
gcttcatgag gecattcagg tgccccgtca gctgggtgaa gtggcagcct ttgggggcag 1980 
caatgtggag cccagtgtcc gtagttgctt ccgttttagc acegggaage cagtcattga 2040 
agcatcccag ttcctggagt gggtcaacct ggagccccag tccatggtgt ggctgcctgt 2100 
tetgeategg gtaaccattg ctgagcaagt gaagcatcag accaagtget ctatctgtag 2160 
cgcagtgccc catcaagggg ttcaggtacc ggagtctgaa gcaattcaac gttgacatct 2220 
gccagacctg cttcttgaca ggcagggeca gcaaaggcaa ttaagctgea ctaccccatc 2280 
atggcagtat tacacaccga ccacatccag tgagaacatg agggactttg ccacaacctt 2340 
aaagaacaaa ttccgctcca agcattattt cagcaaacac cctcagcgag gttatctgee 2400 
tgtgcaatca gtgctggagg ctgactacag tgagacgccg gcttcttccc cgatgtggcc 2460 
acacgccgac acacactccc gaattgagca ttttgecage aggcttgetg agcatggaaa 2520 
gtcaaaattg ctccttcttt aatgaaaggt tgttccccaa ttgaaagctt aaacgaggac 2580 
cagtacctgc tgcggcactc cagccccatc acagaceggg agccagcctt tggacagcag 2640 
gctccatgca gtgtgggcac agaaagcaaa ggggagctac agaagatcct ggcccacttg 2700 
gaagatgaga aceggattet ccagggagag ctgaggcgcc tgaagtggca gcatgaggag 2760 
gcagctgagg cacccagtct ggctgaegge tccactgagg cagcaacaga ccaccgcaat 2820 
gaggagcttc tggecgagge ccgtatcctt eggcaacata agagccgcct ggagacgege 2880 
atgeagatec tcgaagatca caacaagcag ctagagtccc cagctgcagc gtcttgaggg 2940 



180 
240 
300 
360 
420 
480 
488 
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agcttcttcc tgcagccacc caccgaatca gatggcagtg gctctgcagg ctcgtcccta 3 000 
gcttcctctc cacagcagtc agaaggccag tcacccccgg gagaagggac agactactcc 3060 
agataccgag gctgcatgat gatgtggggt caaagagcca ggatgtcagc ctgtgcttgg 3120 
cggacatcat ggagaaactc cgtcatgcct tcccccagtg tgcgaagttc tgatgtgact 3180 
gcgcaacacc ctgctggcct cttgatggag ccagatcccc atcctatagt tcatagtcct 3240 
ctcctggttc cgggtcaaag cctttcctca aggcttcacc caacctttcc cagtttccac 3300 
ctggccccac attcctcaac tagtattatt tgggcatctg ggcagcagca gggatctggt 3360 
ggtatgtgag gtgcatgcgg gcagtgatgg gagaagggga gcgcatgatt ctttcttgac 3420 
cctagaaagt gttcccttta atcttcaagt tcgagatcag ccctttaagt acctttctgt 3480 
tgcagcccag ggcaaatatc acttggccat ttcagatggg gagcagagaa gcctgccttg 3540 
cagaagctaa gcggtacatg ttggcccctc ttccttctcc gtgggaatta tggaaagaga 3600 
gaagcctaat taccgccaag gttccatcca gcattatgaa tccaacaggg ttttacccct 3660 
actctctggc gcaactcagg gaggcaaaag ggtatccccc gagacacctt ccccaccccc 3720 
aagtggcccc tgaaaaagtc tgggcagtgt gtgtactatt aacttcagct ccagggacag 3780 
gattgctagc catcctaatg ctacctaggt ataggtttct cctcaaatat cttcattttc 3840 
tttagcctca gtccctgaca gaccttcctt aagggggaga acagttcatt ctgggttctt 3900 
ctaccgttat atttggaagc attaaatcca tacttatggg tttagaatgt ttgtttttgc 3960 
cggcacatgt gtcctatttc gttaaagtcc tcagcatttt gagaggagcc cactgagttg 4020 
cgagttgctc tggaaacctg gctgaagcca ccc ttgtgct cc tctgagtc tgtgcctcag 4080 
cagacctgag gacacagtgg ccagcttgcg ttgagcaagg gagtgtggct ttcctgtggg 4140 
taaggaagag gaagcccacc tcaagtcttg gaagtagtta tgacctgcca tgcccacaac 4200 
cgcttcccaa ggctgtgacc cctttctagc ttctcaggct gaatcaggtg aaggtggctg 4260 
atttgtcccc agctaatcat aaggcatttg caactggcta ccgtgtgtag tggcccctcc 4320 
ataggagcca caggccacaa atcaggaagc gtggccactt tccagccaga caacaaca.gc 4380 
aaggccttcc tgtgcctaaa attgttattt ctaactctgg aaggtgaatc caatcctcaa 4440 
actgaactag ctcttcctag gcgacatata ccatgatctc tttatccaga aactgttcag 4500 
aatagatagt aggggtgggg gagtgggctt gggaatgttg aatagattca gagcccactt 4560 
aggcagcttt ctcccaagag gagctttata acctagagcc ctgcagttga cttcatttgc 4620 
ctaagtttat atacatacat atctattcta catatctact gtacatcagt gctctcctgg 4680 
cctcactcac acattaaatt ctaatacctc ttgaagtatt aatgttctgc cttgctcttt 4740 
gttagcccca gcctctggtt ggaaagatag atgtatcctg gctgctcccc taggatatct 4800 
ctctccattg atgtttggcg gtgggctgtc ctcaagacca aagcaacagg aaacctgtgg 4860 
tcacagagaa tctaacagga agagtcactg ttctcttcac tcattccaaa tcttgttccc 4920 
ttgagacctg ggccttgggt gaattttcct gggctcccca taaatggttc aggtgccaac 4980 
aatagagcaa acccaagctc tgcccacccc atgcagcaaa aggacaaaca ggccttcatt 5040 
agcctttgcc agctaagaca cagagaacat cctctggctc ccttccagac tcttatggtc 5100 
tcactc 5106 

<210> 158 

<211> 3800 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID*No: LI: 410742 . 1:2000MAY01 
<220> 

<221> unsure 
<222> 3787 

<223> a, t, c, g, or other 
<400> 158 

agaccatctg tagactcctt cggaaagcag cagagatgct gcagagggtt tttcttagac 60 
attaactgca gacggctgtc aggatagaag cagcggctca cttggacttt ttcaccaggg 120 
aaatcagaga caatgatggg gctcttcccc agaactacag gggctctggc catcttcgtg 180 
gtggtcatat cggttcatgg agaattgcga atagagacta aaggtcaata tgatgaagaa 240 
gagatgacta tgcaacaagc taaaagaagg caaaaacgtg aatgggtgaa atttgccaaa 300 
ccctgcagag aaggagaaga taactcaaaa agaaacccaa ttgccaagat tacttcagga 360 
tacccaacca acccagaaaa tcacctaccg aatctctgga gtgggaatcg atcagccgcc 420 
ttttggaatc tttgttgttg acaaaaacac tggagatatt aacataacag ctatagtcga 480 
ccgggaggaa actccaagct tcctgatcac atgtcgggct ctaaatgccc aaggactaga 540 
tgtagagaaa ccacttatac taacggttaa aattttggat attaatgata atcctccagt 600 
attttcacaa caaattttca tgggtgaaat tgaagaaaat agtgcctcaa actcactggt 660 
gatgatacta aatgccacag atgcagatga accaaaccac ttgaattcta aaattgcctt 720 
caaaattgtc tctcaggaac cagcaggcac acccatgttc ctcctaagca gaaacactgg 780 
ggaagtccgt actttgacca attctcttga ccgagagcaa gctagcagct atcgtctggt 840 
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tgtgagtggt gcagacaaag atggagaagg actatcaact caatgtgaat gtaatattaa 900 
agtgaaagat gtcaacgata acttcccaat gtttagagac tctcagtatt cagcacgtat 960 
tgaagaaaat attttaagtt ctgaattact tcgatttcaa gtaacagatt tggatgaaga 1020 
gtacacagat aattggcttg cagtatattt ctttacctct gggaatgaag gaaattggtt 1080 
tgaaatacaa acgtgatcct agaactaatg aaggcatcct gaaagtggtg aaggctctag 1140 
attatgaaca actacaaagc gtgaaactta gtattgctgt caaaaacaaa gctgaatttc 1200 
accaatcagt tatctctcga taccgagttc agtcaacccc agtcacaatt caggtaataa 1260 
atgtaagaga aggaattgca ttccgtcctg cttccaagac atttactgtg caaaaaggca 1320 
taagtagcaa aaaattggtg gattatatcc tgggaacata tcaagccatc gatgaggaca 1380 
ctaacaaagc tgcctcaaat gtcaaatatg tcatgggacg taacgatggt ggatacctaa 1440 
tgattgattc aaaaactgct gaaatcaaat ttgtcaaaaa tatgaaccga gattctactt 1500 
tcatagttaa caaaacaatc acagctgagg ttctggccat agatgaatac acgggtaaaa 1560 
cttctacagg cacggtatat gttagagtac ccgatttcaa tgacaattgt ccaacagctg 1620 
tcctcgaaaa agatgcagtt tgcagttctt caccttccgt ggttgtctcc gctagaacac 1680 
tgaataatag atacactggc ccctatacat ttgcactgga agatcaacct gtaaagttgc 1740 
ctgccgtatg gagtatcaca accctcaatg ctacctcggc cctcctcaga gcccaggaac 1800 
agatacctcc tggagtatac cacatctccc tggtacttac agacagtcag aacaatcggt 1860 
gtgagatgcc acgcagcttg acactggaag tctgtcagtg tgacaacagg ggcatctgtg 1920 
gaacttctta cccaaccaca agccctggga ccaggtatgg caggccgcac tcagggaggc 1980 
tggggcctgc cgccatcggc ctgctgctcc ttggtctcct gctgctgctg ttggcccccc 2040 
ttctgctgtt gacctgtgac tgtggggcag gttctactgg gggagtgaca ggtggtttta 2100 
tcccagttcc tgatggctca gaaggaacaa ttcatcagtg gggaattgaa ggagcccatc 2160 
ctgaagacaa ggaaatcaca aatatttgtg tgcctcctgt aacagccaat ggagccgatt 2220 
tcatggaaag ttctgaagtt tgtacaaata cgtatgccag aggcacagcg gtggaaggca 2280 
cttcagggaa tggaaatgac cactaagctt ggagcagcca ctgaatctgg aggtgctgca 2340 
ggctttgcaa cagggacagt gtcaggagct gtttcaggat tcggagcagc cactggagtt 2400 
ggcatttggt ccctcaggcc agtctggaac catgagaaca aggcattcca ctggaggaac 2460 
caataaggac tacgctgatg gggcgataag catgaatttt ctggactcct acttttctca 2520 
gaaagcattt gcctgtgcgg aggaagacga tggccaggaa gcaaatgact gcttgtggat 2580 
ctatgataat gaaggcgcag atgccactgg tgggtgctgt gggctccgtg ggttgttgca 2640 
gttttattgc tgatgacctg gatgacagct tcttgggact cacttggacc caaatttaaa 2700 
aaacttgccc agagataaag ccccttttgg ttgtttttga aattgggttg aaggcaaaga 2760 
agttcagccc ccacccccct tgctaaagac cagcggttat gggattgaga tcctgtgggg 2820 
catcccatat aaaagtccag cagacaggat ttggtaagtt gccaaacttt gtcaggaagt 2880 
ctaagggagc ttcgtgtttg gtccgcctct gggtcctgtc cagccaagct ggtttcccaa 2940 
ttcccttaac cttctggagc atggtaacta ctttagttaa cgggagattt atctccggct 3000 
tcttggttcc ctcggcgcaa ccgttccagc tgcaggcttt tgatcccact gtctccacac 3060 
gagaatgtga aaagtgccag aaaaagggtg actctgtcgc cagtttccca gtgtctccgt 3120 
ggccaaccta agctgggccc aacgcagcta cgaggcgtca cattactatg ctccggtaca 3180 
gaggatcctt gctcccgtct aatatgaacc cagaatgagc gtggagatat ccacactgga 3240 
gccaaattct ggggatcttg tgggagctaa agtattcaaa catatgcata gcaaacgctc 3300 
acgtgtattt gggctaataa tttggccact tattagcttc tctcataaaa tggattcacg 3360 
attataaatt aaatgttctg ggttcatacc cgcaaaagcc ccataatgtt gtcactccct 3420 
aattctcaca ggtacttatt caaacttgtt agtaaatcct taaaggtttt tcacaaaccc 3480 
ctcaaatcta tattccgcca cgcgaaattg ttccgtaaac agttcttaag cccttctcat 3540 
ttttcccccc tgccccaacg gaaggtgttt taatcacttt taaaaatgca aatagtgaat 3600 
tttagtggga gttaaggcag gtagcgctgg gttcattgtc gtccaattgc ctgtttctgt 3660 
atatgcagtt gatacatgat ggtaaggaat caacaaaggg gcccgggcgc cggtggctca 3720 
cagccatgta atcgccagca cttgtgggag gccagaggca aggtggatct atgaggtcta 3780 
ggagatncga gaccatcctg 3800 

<210> 159 

<211> 2281 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:406568 . 1 :2000MAY19 
<220> 

<221> unsure 

<222> 446, 875, 1361 

<223> a, t, c, g, or other 

<400> 159 

cagcctgcca cttgcctccc tgcctgcttc tggctgcctt gaatgcctgg tccttcaagc 60 
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tccttctggg tctgacaaag cagggaccat gtctaccttt ggctaccgaa gaggactcag 120 
taaatacgaa tccatcgacg gaggatgaac tcctcgcctc cctgtcagcc gaggagctga 180 
aggagctaga gagagagttg gaagacattg aacctgaccg caaccttccc gtggggctaa 240 
ggcaaaagag cctgacagag aaaaccccca cagggacatt cagcagagag gcactgatgg 300 
cctattggga aaaggagtcc caaaaactct tggagaagga gaggctgggg gaatgtggaa 360 
aggttgcaga agacaaagag gaaagtgagg aagagcttat ctttactgaa agtaacagtg 420 
aggtttctga ggaagtgtat acagangagg aggaggagga gtcccaggag gaagaggagg 480 
aagaagacag tgacgaagag gaaagaacaa ttgaaactgc aaaagggatt aatggaactg 540 
taaattatga tagtgtcaat tctgacaact ctaagccaaa gatatttaaa agtcaaatag 600 
agaacataaa tttgaccaat ggcagcaatg ggaggaacac agagtcccca gctgccattc 660 
acccttgtgg aaatcctaca gtgattgagg acgctttgga caagattaaa agcaatgacc 720 
ctgacaccac agaagtcaat ttgaacaaca ttgagaacat cacaacacag acccttaccc 780 
gctttgctga agccctcaag gacaacactg tggtgaagac gttcagtctg gccaacacgc 840 
atgccgacga cagtgcagcc atggccattg caganatgct caaagtcaat gagcacatca 900 
ccaacgtaaa cgtcgagtcc aacttcataa cgggaaaggg gatcctggcc atcatgagag 960 
ctctccagca caacacggtg ctcacggagc tgcgtttcca taaccagagg cacatcatgg 1020 
gcagccaggt ggaaatggag attgtcaagc tgctgaagga gaacacgacg ctgctgaggc 1080 
tgggatacca ttttgaactc ccaggaccaa gaatgagcat gacgagcatt ttgacaagaa 1140 
atatggataa acagaggcaa aaacgtttgc aggagcaaaa acagcaggag ggatacgatg 1200 
gaggacccaa tcttaggacc aaagtctggc aaagaggaac acctagctct tcaccttatg 1260 
tatctcccag gcactcaccc tggtcatccc caaaactccc caaaaaagtc cagactgtga 1320 
ggagccgtcc tctgtctcct gtggccacac ctcctcctcc ncgggactcc tccactccca 1380 
gagaaaaagc tcattaccag aaacattgca gaagtcatca aacaacagga gagtgcccaa 1440 
cgggcattac aaaatgrgaca aaaaaagaaa aaagggaaaa agg.tcaagaa acagccaaac 1500 
agtattctaa aggaaataaa aaattctctg aggtcagtgc aagagaagaa aatggaagac 1560 
agttcccgac cttctacccc acagagatca gctcatgaga atctcatgga agcaattcgg 1620 
ggaagcagca taaaacagct aaagcgggtg gaagttccag aagccctgcg ataaaaacat 1680 
gatctttaga agaggatgca gaactgttca gtggtattac atgaaatgca ttgtgagatg 1740 
tttctaaaat accttcttca attcaaaatg atccctgact ttaaaaataa tctcacccat 1800 
taattccaaa gagaatctta agaaacaatc agcatgtttc ttctgtaaat atgaaaataa 1860 
atttcttttt tatgtcgtga gatttgtatt ggcaagaagc agttaattta aagatgctct 1920 
tcctatctgt ggatgtgttg gtaactccga gttgtaatga gttcatgaaa tgtgctgtta 1980 
tttttgtaat ctcaataaat gtggattgaa gttttttccc tttttttaaa gccaaactaa 2040 
tatttttctg tgacttgata catctgtcag atttttgtaa tctcgataaa tgtgtattga 2100 
agttttttcc ctttttttaa aaagccaaac taatattttt ctgtgagtta atacatctgt 2160* 
caggtgtgta tgtaacatta ctggacatta aaaaaaatta ttacattctc acccaaaaag 2220 
ggtttgggtc ctaagcattt gcctttcttt gtttcctctt gttcaagaaa atc.tgattag 2280 
a 2281 

<210> 160 

<211> 5408 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID NO: LI : 283762 . 1 :2000MAY01 
<220> 

<221> unsure 
<222> 4004 

<223> a, t, c, g, or other 
<400> 160 

gcgagcgcgg tcgagatcgc catgcctacc cgagtatgct gctgctgttc cgctttgcgt 60 
cctcgctaca aacgcctggt ggacaacata ttccctgaag atccaaaagt aatttgatct 120 
acatctactg atccttctct ttgctgaccc attctgcccc ccgctgacct ccttaaggat 180 
ggccttgtga aaactgatat ggagaaattg acattttatg cagtatctgc tccagagaaa 240 
ctggatcgaa ttggttctta cctggcagaa aggttgagca gggatgttgt cagacatcgt 300 
tctgggtatg ttttgattgc tatggaggca ctggaccaac ttctcatggc ttgccattct 360 
caaagcatta agccatttgt agaaagcttt cttcatatgg tggcaaagct gctggaatcg 420 
ggggaaccaa agcttcaagt tcttggaaca aattctttgg tcaaatttgc aaatattgaa^ 480 
gaagacacac catcctactc acagacgtta tgactttttt gtgtctcgat tcagtgccat 540 
gtgccattcc tgtcatagtg atccagaaat acgaacagag atacgaattg ctggaattag 600 
aggtattcaa ggtgtggttc gcaaaacagt caacgatgaa cttcgggcca ccatttggga 660 
acctcagcat atggataaga ttgttccatc cctcctgttt aacatgcaaa agatagaaga 720 
agttgacagt cgcataggcc ctccttcttc tccttctgca actgacaaag aagagaatcc 780 
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tgctgtgctg gctgaaaact gtttcagaga 
gaataatgct gttagaccag tttttgcgca 
tgaatttgca gttcactgct ttaaaattat 
ccatgtgatc caggagattc taggacacct 
tcgagcaggt attattcagg ttctgttaga 
aggtccgaca gtgctggaag tcttcaatac 
attcgaaagc aaatgattac agggggggat 
tccaaagaca atgatgagaa gattgtggca 
tggaagtaac ctaccagatt atcagaggtc 
acctgtcttt ggaacatcta cccatacttt 
caggagaatt cagataatgt tgctgagatc 
gaagacgatt gttactgcac tgccagggtc 
tcatggagga ctacgaactg agacagttgg 
cgtcatgaca atagggaaaa gcttcgaggg 
aagataaaaa gagaaaaaat ttgcagacaa 
cagctgtatc gggcacatat atttgggttg 
tgaactactt tatacttctc ttgctcttat 
ttattgatct cattcgacct ggccattgct 
taatttgcca atgttccatc gttgtggaat 
tgtaagtcag atgatagctg tccctgcatt 
tcgaactatg gaagcccctt attttctacc 
tccaaaatct ttagaggaag catgaaaaag 
agtcgctagg tggaagtgga tatagtgttg 
taacagatga agatcgactt tctagaagaa 
tggatatttt atccaacaat gttccttctg 
cttttgaagc attggagaaa gccattggat 
aagaggcgtc ttgtgataga gaaatttcag 
tgtgaatcca aagcaaattt gcttcatgat 
acgtcctcct cccagtccat caggaacact 
atctgtccca gtctatgaga tgaagtttcc 
agacctcagg atatgatttg taaagcctaa 
tacttaatgt gtattaacat acttcttgaa 
gtttggcata ccatattaga agttttggag 
agatttatgc catgttaatt tgctttgagg 
ttggtttcac tttattccac tgttaagtag 
ttttttaaaa agtaagcctt cagaggattg 
tttaaacagg cggtttagat gtagaatttt 
ttaaatatat tcctagtcct gggggctgca 
gcgttagata agctggcgat aggcaccttg 
gcattcctca tcttcttcac aataatagga 
ccctaaaaac gccatacttt aaattgtctg 
agtttggacc attctgcttg agaagtgtag 
atatttttta catcataaga tgtatacagt 
tattttgctg tggctccatt aattaaatga 
ttacaacagc tattcaagtt cacctgtgaa 
gcacatttat ttacatttgt atattatttg 
gggactttta atttgtaagg atttgaactg 
tatatttgaa aagctgcctg tgttttaaca 
gctggctttt gatttcagct accaagatca 
gacccataac attacataca tcttaggtgg 
tttctttgta actcataaaa ttctcttagg 
ctatcttttc agattccatt tctttttaca 
ccttagaatc cagtttaatt tttgggcttt 
atggggatgt ttgtgtttat gggctatttg 
gtaaggacaa cgataatagg acaaggcata 
gcgcaacatc ttggaaaacc agtctagtca 
ctggggctgc ttcttaaata agtagatcca 
agtcgttacc gctggaggac taaagggccc 
ttgatcagga aaaatgcttc accagtccgt 
ttttttcctt ctgtattttt aaagaaggat 
atctaagcag ggggacatgc aaaaacaatc 
attaacactg tgtgcttttg tatggaaaaa 
catatatatt catttgcact tacgtgaaac 
cttaggtgat tttaaaaatg catgtattgc 
tttaatccac acctttatcc ttggtctttg 
ttactgtttt taaattacag taggcttcct 
ttgacaagga ttcccaagaa atggatctta 



actgctgggt cgagcaactt ttgggaatat 840 
tttagatcat cacaaactgt gggatcccaa 900 
aatgtattcc attcaggctc agtattctca 960 
tgatgctcgt aaaaaagatg ctccccgggt 1020 
ggctgttgcc attgctgcta aaggttccat 1080 
ccttttgaaa catctgcgtc tcagcgttga 1140 
ctgtagccag tgtcaattta aattacaagt 1200 
gaatgctatc atccaaacaa taggattttt 1260 
agaaatcatg atgttcatta tggggaaagt 1320 
ggatatcagt caactagggg atttgggaac 1380 
tttgcttatg gtgacctctg gatataaagc 1440 
tttcctggat cctttggtta tcaccatctc 1500 
tcttggaagt aatgacataa tatcatggat 1560 
ataagaataa taacggatgt agctgaccta 1620 
gacacaagtt tcatgaaaaa gaatgggcaa 1680 
taaagaggaa gacaacgttc agaaaaacta 1740 
aactattgaa ctgggctaat gaagaagtag 1800 
tttacaggac agtgcaatta tcaatgagga 1860 
catggcactg gttgcagcat acctcaactt 1920 
ttgccagcat gttagcaagg ttattgaaat 1980 
agagcatatc ttcagagata agtgcatgct 2040 
atttgtactt tctgaccaac aagattgcag 2100 
agagattgtc agttccgtat gtaccacaag 2160 
aaagcattgt ggacaccgta tccattcagg 2220 
atgatgtggt tagtaacact gaagaaatca 2280 
accagtggaa tggaagaaca ggaaaaggaa 2340 
aaagcacctt ttgaagaaat agcagcacag 2400 
agacttgccc caaatattgg aactcaccat 2460 
gaccattact tctgggcatg cccaatacca 2520 
agatctgtgt gtgtactgat cggcgcatga 2580 
aaattaaggc caagctgagc tttcagggtt 2640 
aataatgatg gaacatatct ttaaccaaat 2700 
ctatatataa tttcgagtac tttcaaagat 2760 
ttcctgttgc ctttttaagt tgaacatgtt 2820 
tatgttttaa acttttcaca aaatgtaatg 2880 
aaactggatt aattggttat ttcttaaaca 2940 
tgttgttgtt ttctgcaaag gcagatacat 3000 
aaactgttcg gtgggctttt tgtccccagt 3060 
ctattcagtt actataataa tatgtgatag 3120 
catctgttga atagcattcc tcgaatataa 3180 
gttcttgtaa tattgtgttt cctgccacag 3240 
cagcttactc ttggagtacc aaactgtgca 3300 
ttacaggctg tgcttcgaaa ttcatagtag 3360 
gatatcatat ttagtgcaga aaacagacat 3420 
ggagtctgta cattttgatt tctattaaga 3480 
aaatcctcat actcaaaaaa ttctccagat 3540 
tgacttgtat acatctgttt aggaatcaat 3600 
gtcaagtgtg ctaaagtttg tcaatttaag 3660 
caggtgcact ctacacataa cactggacag 3720 
aattctatca catgggtaaa atggaacggc 3780 
acatttttat aaagtcacct gtttatagtt 3840 
taaaacagca tacatatgca aaaactgtaa 3900 
gtttttggtt taaaaaaatt gcagtgaaga 3960 
gggcaccttt tagntaggaa acagacaaaa 4020 
cttgaaaatt tctggaatac ttaaacaaaa 4080 
ttgaaaccta tgaaatgaca ctgaaagatc 4140 
gcaagacttg tttccagagt gaacagtggg 4200 
tgtgggcagc tgtcactgga actttgcctc 4260 
aaagccaagt tgtatttttt ttattgccct 4320 
gttaattttt gacctatata ttttaaaaaa 4380 
atccacccac ttgggatgtc atttttatag 4440 
atatatataa tttaatagta taaaacataa 4500 
acaaactttg ctctacaaaa tttcgtgttt 4560 
atgtaaagga aaacccatta ccaattaatg 4620 
ttggatttgg ggttttgttg ggggtttttg 4680 
atatacccct ggatttctga actggtcttg 4740 
tcaactggct gactctccca tatctgcaag 4800 
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acccattctg caggaactgg gtcgtgcaca 
tttaggattt ggtgaagttc actgtattgc 
taaattgtta atccattctc ttttttgctg 
atgacattgg ttaacattta aatgttcctc 
atagagtttt attgttattg gtgtttaccc 
tttccttaat acttctgaac tcattatctt 
ccagcaatta acttcctccc ctaccccaaa 
ctaatttgtc ttattcgtgc ttatatctgt 
aatgtaaaat ttccactgtg caatttcata 
tataagttca tttttatagc tattgttaca 
attaaaaa 



cggtgttgta gccagttcag gtactgaata 4860 
tatatttttg taagataaat aaaactcaat 4920 
tatagaattg cttatatcac tctttctttc 4980 
ctgtacttgt gttgtcctgt gacccgcctt 5040 
tgaataccct atgccgtaca cacacacata 5100 
ttagaataat aatacctacc tacactttac 5160 
atgtttttcc ttccctgtct gaaaatggaa 5220 
attaaaatgc aataaaggtt agttttttga 5280 
tgtcagcaaa taaaccatag attatttgta 5340 
tttaattaaa tttattcaat atttgcttaa 5400 

5408 



<210> 161 

<211> 2895 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID NO: LI : 347 687 . 113 :2000MAY01 



<400> 161 

agccaacctt acttacagca taagggagcg 
gccttacacc tcgttaggga agagaaacag 
ccttgataac agctaccatg acaaccctgt 
ctagcttaca tgagaacaga ctggcctgag 
ggctgtggac atgaagccct cactggcaca 
aactaccaag actatgagta tctcatcaat 
ggaactgcct ctttcttctt cctttatggg 
accggcgcag tcaggcagat ctttggcgac 
agcgcaacgt ttgtgggcat cacctatgcc 
tgctctgctg tgcctgtgta catttacttc 
ttccccagca agacctctgc cagtataggc 
gttctcccat ggaatgcttt ccctggcaag 
aaaacagctg agttccaaat gaccttccac 
gctacactgg tttccctgct caccttcatg 
aaactcatgg gccgaggcac caagttctga 
gcgaggctct aaccacacag cctacaatgcf 
caccaactgg ccctcttctt acttgatgag 
taaggtctct ctttggactc tcccctctta 
tggttcctgc tagaaatggg aaatgcctaa 
ggaatggagg ctctaattga attttcaagc 
tcaaagggta cttccactga tggaaacaaa 
ggaatgtctt tggtcctctt gccatctata 
aatggaattc attctggtct ctctattacc 
aaaacaataa gagcgtttgc ccaaatctgc 
aaggatcttt cacccacaga aagagagcac 
cctaactcag ccaaccttac ttacagcata 
ggcatctggc cttacacctc gttagggaag 
tcccttctcc ttgataacag ctaccatgac 
gaaggaaact agcttacatg agaacagact 
atggtgtaac ctgagatggc cctctggtag 
gtcttttttc tgttaattag ttgtgtactc 
catttcatgg ggtattatcc attcagtcat 
tagaatgatg cacatttcat gtattccagt 
aaatgtctgg agaataattc tttgattatg 
taagcatcac aaatgatatt aaaaattggc 
attctgcctt tgcagaaaca gatttggtga 
ttctttctag ctaatggaaa atgattttac 
taggtttaac ataaaggtta ttttctcctg 
a t cage tat t cagttggtaa gcttccagga 
gaatagctcc cagatttcag tcttttcctg 
aaagtctgat tggttttaat tgaaggaaag 
gttgtgttgt tcttttcccc caaagtggtt 
tacaaaaaga cacttgatac ttgttttcaa 
ctggaaggag gagagggaaa atgttttgta 
taatcegtae cttgtgtttt gttttgattt 



tagaatctgt gtagacgaag ggggcatctg 60 
ggtgttgtca gcatcttctc actcccttct 120 
ggtttccaag gagctgagaa tagaaggaaa 180 
gagcagcagt tgctggtggc actgttctgt 240 
gaaaagctaa ttgagaccta tttctccaaa 300 
gtgatccatg ccttccagta tgtcatctat 360 
gccctcctgc tggctgaggg cttctacacc 420 
tacaagacca ccatctgcgg caagggectg 480 
ctgaccgttg tgtggctcct ggtgtttgcc 540 
aacacctgga ccacctgcca gtctattgee 600 
agtctctgtg ctgacgccag aatgtatggt 660 
gtttgtggct ccaaccttct gtccatctgc 720 
ctgtttattg ctgcatttgt gggggctgea 780 
attgetgeca cttacaactt tgccgtcctt 840 
tcccccgtag aaatccccct ttctctaata 900 
tgcgtctccc atcttaactc tttgccttgc 960 
tgtaacaaga aaggagagtc ttgcagtgat 1020 
tgtacctctt ttagtcattt gcttcatagc 1080 
gaagatgact tcccaactgc aagtcacaaa 1140 
atctcctgag gatcagaaag taatttcttc 1200 
gtggaaggaa agatgetcag gtacagagaa 1260 
ggggccaaat atattctctt tggtgtacaa 1320 
actgaagata gaagaaaaaa gaatgtcaga 1380 
etattgeage tgggagaagg gggtcaaagc 1440 
tgaccccgat gggcgatgga ctactgaagc 1500 
agggagegta gaatctgtgt agacgaaggg 1560 
agaaacaggg tgttgtcagc atcttctcac 1620 
aaccctgtgg tttccaagga gctgagaata 1680 
ggcctgagga gcagcagttg ctggtggcta 1740 
acacaggata gataactctt tggatagcat 1800 
tggcctctgt catatcttca caatggtgct 1860 
cgtaggtgat ttgaaggtct tgatttgttt 1920 
ttgtttatta cttatttggg gttgcatcag 1980 
actgtttttt aaactaggaa aattggacat 2040 
tagttgaatc tattgggatt ttctacaagt 2100 
atttgaatct caatttgagt aatctgatcg 2160 
ttagcaatgt tatcttggtg tgttaagagt 2220 
atatagatca cataacagaa tgcaccagtc 2280 
aaaaggacag gcagaaagag tttgagacct 2340 
tttttgttaa ctttgggtta aaaaaaaaaa 2400 
atttgtacta cagttctttt gttgtaaaga 2460 
tcagcaatat ttaaggagat gtaagagctt 2520 
accagtatac aagataagct tetatgeage 2580 
agaaaccaat caagataaag gacagtgaag 2640 
aataacataa caaataacca acccttccct 2700 
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gaaaacctca catgcataca tacacatata tacacacaca aagagagtta atcaactgaa 2760 

agtgtttcct tcatttctga tatagaattg caattttaac acacataaag gataaacttt 2820 

tagaaactta tcttacaaag tgtattttat aaaattaaag aaaataaaat taagaatgtt 2880 

ctcaatcaaa aaaaa 2895 

<210> 162 
<211> 561 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 1146510 . 1 :2000MAY01 
<400> 162 

tgcataaaaa gcagagcgta gcaagtaagc tgcaaatctg aagaaaatcc atgtgtccac 60 

cagctgccat ctccagaact cttactccag gaaatccgaa agagtgaaca ttcttttagt 120 

ctcctactcc tcaattaagt aaatgagaat gagtcagcca acaaagttca tgacaacaag 180 

gtgcaggatg gtgctggcaa agagaaaatc agcaaaggct cgctctgggg agatgccttg 240 

gaaatccgct ttgttctctg ggttgactct gtaattacat cacggccata ccgctaggat 300 

gaactcccca cacaagagat gaagcccgag agaaaagtag ttgaacgagc ggaatggtcc 360 

ccagcgacgg agacagtgaa ccgaactgca gcgccccagg tcagccagta tatacagcag 420 

gtacgcgtcc agcaacttca gacgctgcgg agtggagctc aagtactctt ctaagaaccg 480 

cgaaatgaca gacactaccg acgccgtaca taactgcaac gcaaggtact ccaa;tccgcg 540 

ccccaaactc ttggaggacc c 561 

<210> 163 
<211> 630 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG:451710 . 1 :2000FEB18 
<400> 163 

gttcgtccgc tagggtttta gcccggtacc ccgaatcccg ctcctccgcc gcctcgcgtc 60 

cacccttcct cttggcgcgc tccgatccaa gagatggcgg tgccgctgct gacaaagaag 120 

atcgtgaaga agcgggtcaa gcagttcaag aggccccatc tcgaccgcta taagtgcctt 180 

aagccaagct ggcgcaggcc aaagggtatt gactcccgtg tcaggaggaa gttcaaggga 240 

tgcaccttga tgcccaacat tggttacggc tctgacaaat caaccaggca ctacctcccc 300 

aataagttca agaagtttgt ggtgcacaat gtctctgagc tggagttgct catgatgcac 360 

aacagaacct actgtgccga gatcgcccac aacgtttcaa ccaagaagcg caaggagatt 420 

gttgaacgtg ccgcgcagct cgacattgtg gtcaccaaca agctcgccag gctccgcagc 480 

caggaggatg agtagtctgg tttcttgtca taacggccaa ctagatcctg ccattcccgg 540 

ctgtaggaaa ctactctgct ttgtgccttg ttggtactta gaataccagt cattttctac 600 

atttttggta ccagtagcga cttctatgtg 630 

<210> 164 
<211> 570 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 455771. 1:2000FEB18 
<400> 164 

ccgcttcttg cagccgtcgc cgcgaagcgc tccagagaac ttccgtcaac atggggaaga 60 
cacgtggtat gggagccggg cgcaagctca agacccaccg caggaaccag cggtgggctg 120 
acaaggcata caagaagagc catttgggca atgagtggaa gaaacccttc gctgggtcat 180 
cccatgccaa gggcattgtc ctggagaaga ttggtattga ggccaagcag cccaactccg 240 
ctatccgtaa gtgtgctcgt gttcagcttg ttaagaatgg caagaagatt gctgccttcg 300 
tgccaaatga cggttgtttg aactacattg aggaaaatga tgaggtcttg attgctggat 360 
ttggtcgtaa ggggcacgct gtgggagata ttcctggtgt ccggttcaag gtcgtcaagg 420 
tttccggtgt gtctctgctt gcccttttca aggagaagaa agagaagcca aggtcttaga 480 
ttgctcttgc taccaaaatc agcaagcgtg gagttgaaac gggagggcgt tagatgatta 540 
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agaagaatgg ttgcattgct atgtttgcag 570 

<210> 165 
<211> 821 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 452089 . 1 : 2000FEB18 
<400> 165 

ttacacgtcg tgactgggaa aacaccgtcg tccgccgccg ccgaaggacg gaaggagaag 60 
agggtacggc cgtctcctcg cccccatggc ccacgagaag aagctgtcca acccgatgcg 120 
ggagatcaag gtgcagaagc tcgtcctcaa tatctccgtc ggggagagcg gcgaccgtct 180 
cacccgcgcc gcaaaggtgc tcgagcagct cagcggccag acccccgtct tctccaaggc 240 
gaggtacacg gtgcggtcgt tcggcatccg gcgtaacgag aagatcgcct gctacgtcac 300 
ggtgaggggc gagaaggcca tgcagctgct tgagagcggc ctcaaggtca aggagtacga 360 
gctgctcagg aggaacttca gcgacaccgg gtgctttggt ttcggcatcc aggagcacat 420 
cgaccttggt atcaagtacg atccttcaac aggcatctac ggaatggact tctacgtcgt 480 
gctggagcgt gcgggctacc gtgtggcacg ccgcaggagg tgcaagtccc gcgtcgggat 540 
tcagcacagg gtgaccaagg aggactccat gaagtggttc caggtcaagt acgaaggcgt 600 
catcctcaaa caaggctcag gcctacacgt cccccccctc acctgtgggc aaaacagctc 660 
tctggtctct tctcctcctc cgtgtcaacg caagaccacc acccacctcg ccaggctttt 720 
ttgggtttaa tttgtgttct cagacctgat gatttagtct gctagcactc tgtgggatgc 780 
tgttgtcgca agatcaccac catgtcaaat tagttccgtg c 821 

<210> 166 
<211> 503 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LG: 246415 . 1 :2000FEB18 
<400> 166 

tgcttcctcc agtccgtgcc tccaatatga tgaaaaaaag aaggaacaac ggtcgtacca 60 
aaaagggccg cggccacgtg cagcctattt gcgacacgaa ctgtgcccaa tgtgtgccca 120 
aggacaaggc tattaacaaa ttcatcattg gaaacacagt ggaggctgca gcagtcaggg 180 
acatttctga agcgagcgtc tttgatgcct atgtgcttcc caagctgtat ttgaagctac 240 
attactgtct gagttgtgca attcacagca gagtagtcag gaatcgatct tgtgaagccc 300 
acaaggactg aacaccccta ctccaattta gacctgcgga tgctgcccca caacccccac 360 
caaagcccac gtaaggagtt gagtccttaa agtctgaaga cggactattc tcattgagaa 420 
aaataaaatg gaaattgtac cttaattaaa aaatacaaaa agtagccatg agtggtgaca 480 
tgcaccatgt ggtcccagct aca 503 

<210> 167 
<211> 753 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID, NO: LG:414144 . 10 :2000PEB18 
<400> 167 

ctgcaccact agaaagatgg cggagcaaga gcaaagaaaa atccctttgg ttccagaaaa 60 
tctcctgaaa aagaggaagg cttatcaagc cctcaaagcc acccaggcaa agcaggcact 120 
tttggcaaag aaggagcaga agaaaggaaa agggctcagg tttaagcgac tggaatcatt 180 
cctacatgat tcctggcggc agaaacgtga caaggtgcgt ctcagacgac tagaagtgaa 240 
acctcatgcc ttggaattgc cagataaaca ttccttgggc ctttgttgta cgcatcgaaa 300 
ggattgacgg cgtgagttta ctggtgcaga gaactcattt tgaaacgtgg acaagccaag 360 
gtcaagaata agaccatccc tctgacagac aatacagtga ttgaggagca cctggggaag 420 
tttggcgtca tttgcttgga agacctcatt catgaaattg ccttcccagg gaagcatttc 480 
caggagatct catggttctt gtgccctttc cacctctcag tggcccgtca tgctaccaaa 540 
aatagagtgg gcttcctcaa ggagatgggc acacctggct atcggggtga acgcatcaat 600 
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cagctcatcc gtcagctgaa ctagacccag gtgccaaact gcggtaaatt ttttatcagt 660 
gaagtggaag catgtgtttt gttttggaaa tttttatcaa gtatcttcag agaagattat 720 
ttcctgcttt atcttcagaa actggaaagg gtc 753 

<210> 168 
<211> 885 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 1101445 . 1 .-2000FEB18 
<220> 

<221> unsure 

<222> 754, 809, 855, 866 
<223> a, t, c, g, or other 



<400> 168 

ctggaaccat ggaggctgta 
ttaagaagaa aaaggttcct 
cagagttgaa ggtcaagcgc 
ggaggaagct catctatgag 
ggactgagat tcgcatggct 
aaccaaaatt ggcctttgtc 
gcaaggtgct gcagctgctc 
acaaggcttc agtgaacatg 
aacctgaagt cagtaaacga 
cgcattgcct tgacagataa 
tgcatggagg atctaattca 
aacttcctgt ggcccttcaa 
cactttgtag aaggt'ggaga 
ggaaggaatg aacttaaggt 
ataaacagta cctgnctctc 

<210> 169 
<211> 586 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: LG: 452134. 1 :2000FEB18 

<400> 169 

tacacccgcc gagatattga tgcctaagaa gaactgggat tcccatttat gaactccttt 60 
ttaaggagag agtcatggtg gccaagaagg atgtccacat gcctaagcac ctggagctgg 120 
cagacaagaa tgtgcccaac catcatgtca tgaaggccat gcagcctctc aagtcccgag 180 
gctaagtgaa ggtacagttt gcctggagac atctctactg ctaccttacc aatgagggca 240 
tccagtatct ccgtgattac cttcatatgc cccggagatt gtgcctgcca ccctacgccg 300 
cagccattct gagactggca ggcctcagcc taaaggtctg aagggtgagc gacctgcaag 360 
actcacaaga ggggaagccg acagagatac ctacaggcag attgctgtgc cccctgatgc 420 
cgacaggaag gctgaggctg aggctggggc tgggtcagag acggaattcc agtttagagg 480 
cagatttggt tgtggaggtg gtcagccacc tcagtaaagt tggagaggat tattttgcat 540 
taaataaact tacagccaaa aaaaatttaa aaaaaaatct cattaa 586 

<210> 170 
<211> 659 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI: 903021. 1:2000FEB01 



ccagagaaga aaaagaaggt 
gcggtgccag aaacccttaa 
ctgaggaaga agtttgccct 
aaggcaaagc actatcacaa 
aggatggcga ggaaagctgg 
atcagaatcc gaggtatcaa 
cgtctccggc agatcttcaa 
ctgaggatcg tggagcccta 
gctcatctac aaacgaggct 
ctccttggtt gctcgatctc 
tgagatctat acagttggaa 
actgtcttcc ccacgaggtg 
tgctggcaac aggngaggac 
ggtctaccna tggattattt 
aaaatngaaa aaaaaaagaa 



tgccgctgcg ccaggaaccc 60 
gaaaaagcga aggaatttcg 120 
gaagacactg cgaaaggcaa 180 
ggagtacaga cagatgtacc 240 
caacttctat gtgcccgcag 300 
tggagtgagc ccaaaggtgc 3 60 
tggcaccttt gtgaagctca 420 
cattgcatgg ggggtacccc 480 
atggcaaaat caataaaaag 540 
ttggtaaatt tggcatcatc 600 
aacgctttaa ggaagcaaat 660 
gaatgaagaa aaagacaact 720 
cagatcaaac aggcttatta 780 
ttctaagctg ggttgggtta 840 
taaaa 885 



<400> 170 

ctcagttgtg agggggaagg tagactcttt ccagaccttt gataaagggg ctgctcaaaa 60 
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aagccacgac cttgggcctc tgctaaaace atggcfcttaa, ,gtaaggaggg agcagatata 120 
taaaaaacct tatccctctt tctaatctct tatgtcctgt 'gagtgtcacc cattgagtaa 180 
acccaagtgg aagtcagcag gaaagtagct gggatgatgc agtctttaca gggtgttgaa 240 
gagaagaaga aggttcctgc tgtgccagaa acccttaaga aaaagtgaag gaatttcaca 300 
gagctgaaga tcaagcgcct gagaaataag tttgcccaaa agatgcttct aaaggcaagg 360 
aggaagctta tctatgaaaa agcaaagcac tatcacaagg aatatatgca gatgtacaga 420 
actgaaattc aaatatcga,g gatagcaaga aaagctggca acttctatgt atctgcagaa 480 
cccaaattgg cgtttgtcat caggatcgga ggttatcaat tgggtgagcc caaaggtctg 540 
aaaggtgttg caacttcttt gccttcatca aatcttcaat agaaactttg tgaagctcag 600 
cagggcttca ataatgtgct gaggtttgta gaaccatata ttgcatgggc atacccaaa 659 

<210> 171 
<211> 443 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 24 6422 . 1 :2000FEB01 
<400> 171 

tgttgccttc tttctgcctc cgctaccgcc atggcgccca tgaaaaagct tgtggtaaag 60 
gagggctaaa aaaaaaggaa gcaggttcca aagttcactc ttgatcgcac ccaccccgta 120 
gaagatggaa tcatggatgc tgccaacttt gagcagtttt tccaagaaag gatcaaaatg 180 
aacggaaaag ctgggaactt tggtggaggg gtagtgaccc atcgaaggga gcaagagcaa 240 
gaccagcgtg acatccaagc tgcccttttc caacaggtat ttgaaatatc tcaccaaaaa 300 
atatctgaag aagaataatc tacatgattg gttgcgcgta gttgctaaca gcaaacagag 360 
ttacgaatta cgttacttcc aaattaacca ggacgaagag gaggaaaacg aggattaaat 420 
ttcatttatc ggccgggcgc ggt 443 

<210> 172 
<211> 586 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 449404 . 1 :2000MAY1 9 
<400> 172 

atccgtcgcc ccacggaagt ctcggcggcg ccggcgtgct tgtgatctca acccaagtcg 60 
tgcctccgct ccggtcaccc gtcgctccac gcaaccatgt cgaggaggaa gaccagggag 120 
cccaaggagg agaacgtcac ccttggaccc actgtccgtg aaggagagta tgtctttggt 180 
gttgctcaca tctttgcatc cttcaatgac accttcattc atatcactga tttgtctggg 240 
agggaaactc tggttcggat caccggtggc atgaaggtga aggctgaccg tgacgagtcg 300 
tcaccttacg ctgctatgct tgctgctcaa gatgtcgcac agcgctgcaa ggagcttggc 360 
attactgcac tgcacattaa gcttcgtgcc accggaggca acaagaccaa gacccctgga 420 
cctggtgccc agtctgccct cagggcgctt gctcgttccg ggatgaaaat cggacgcatt 480 
gaggacgtta ccccggtccc cacggacagc actcgcagaa agggcggtag gaggggaaag 540 
gaggactgta ggcgtcatca ttactgcgtg ccatttgctg gctctt 586 

<210> 173 
<211> 551 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 449413 . 1 :2000MAY19 
<400> 173 

ccgccggcct ccaactctaa ccccgttgct tccggctccc gcagcgtcgc cgaccttcac 60 
agccggttcc ttcctccgcc accatgggga agacacgtgg tatgggagct gggcgcaagc 120 
tcaagaccca ccgcaggaac cagaggtggg ctgacaaagc atacaagaag agccacttgg 180 
gcaatgagtg gaaaaaacct tttgctggat catctcacgc caagggcatc gttctggaga 240 
agattggtat tgaggccaag cagccaaatt cggccatccg taagtgtgcc cgtgttcagc 3 00 
tggtgaagaa tggaaagaag attgctgcct ttgtgccgaa tgatggttgc ctaaactaca 3 60 
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tcgaggagaa tgtatgatga ggtgttgatt gctggatttg gtcgtaaggg tcatgctgtg 420 
ggagacattc ctggtgtcag gttcaaggtt gttaaggtgt ctggtgtgtc gctgcttgca 480 
ctcttcaagg agaagaagga gaagccaagg tcttagatca ctttcggtag tccagaatgg 540 
tgtaaactgc c 551 

<210> 174 
<211> 565 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID NO: LG:450105 . 1 :2000MAY19 
<400> 174 

gccgtcggca gctcggccgc cgtactcccg ctaccgagct aggaggcatc accttcgccg 
atccaacatg ggtaagacac gtggtatggg agctgggcgc aagctcaaga cccacagaag 
gaaccagagg tgggctgaca aagcctacaa gaagagccat ctcggcaacg agtggaagaa 
accctttgct gggtcatctc acgcaaaggg aatcgtcctt gagaagatcg gcattgaggc 
taagcagcct aactctgcta tccgtaagtg cgctcgtgtc cagctggtga agaacgggaa 
gaagattgct gcctttgtgc caaacgatgg ttgcttgaac tacatcgagg agaacgatga 
ggtgctgatt gctggattcg gtcgtaaggg ccatgctgtg ggagatattc ccggcgtccg 
tttcaaggtc gtgaaggtct ctggcgtttc cctcctcgct ctcttcaagg agaagaaaga 
gaagccgaga tcttaaaacg ctgcaagggt ttgggcctgg tggcgcaccn cacacctgta 
gcgctcacac aagccatagc agacc 

<210> 175 
<211> 336 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:460809 . 1 :2000MAY19 
<400> 175 

gcgcatgggt ggagtgggct tcaagaagtg cccctcgggc acacagagag atccagaaat 60 
tcgccatgaa ggagatgggg actccaaatt tgcacattga tgtgaggctc aacaaagctc 120 
tctgggccaa aggaataagg aatgtcccat accatatcca tatgaagttg cccagaaaac 180 
ttaatgagga tgaagattca ccagacaagc tctatgcttt ggttcctaca tatacctgtt 240 
accactttca caaatctata gacaggcaat gtggaagaga gctaaccact gatggttcaa 300 
tacattaagt aaaattattt ttaaaaaaga aattta 336 

<210> 176 
<211> 932 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 481781 . 1 :2000MAY19 
<400> 176 

ggcggagcgt aaggcgaagg agtagcagca gcaggcggcg ccgagtagcg gctccccatc 
tcgagcttgc caccatggct agaggattga agaagcattt gaagaggctc aatgccccca 
agcattggat gctggacaag cttggcggag cttttgctcc caagccatct tctggacctc 
acaagtctag ggagtgcctg ccactgatcc tcatcatcag gaacaggctc aagtatgctc 
ttacataccg tgaggtcatt tccatcctga tgcaacgcca tgtacttgtt gatggcaagg 
tcaggacaga caagacctac cctgctgggt tcatggatgt catttccatc cccaagacca 
atgagaacta caggctgctg tacgatacca agggccgctt ccgccttcac ccaatcaggg 
atgaggatgc taagttcaag ctttgcaagg ttaggtctgt ccagtttggg cagaagggca 
tcccttatct gaacacgtat gacggccgca ccatccgcta ccctgacccc ctcatcaagg 
ccaacgacac catcaagatc gatctggaga ccaacaagat tgtggacttc atcaagtttg 
atgttggcaa cgtcgtcatg gtgactggcg gaaggaacac tggccgcgtg ggtgttatca 
agaacaggga gaagcacaag ggcagctttg agaccattca cgtggaggac tcctgggcca 
ccggttcgcc acccgtatgg gcaacgtgtt caccatcggc aagggtaata agccgtgggt 
gagcctgccc aagggcaagg gaatcaagct gagcatcata gaagagcaaa ggaagcggga 
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tgccgctgcc caggctgctg ccaacgcata aatcctctga atgtgctgtg ttgagagttt 900 
tttctagttg ctctgcaaga atatagaaca at 932 

<210> 177 * 
<211> 733 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG:1101153 . 1 :2000MAYi9 
<400> 177 

gcgaaggcga ccattccttc ttcgctcctt cgccgccgct atggttaagt actcgcagga 60 
gccgggcaac cctaccaaat cggccaaggc catgggaagg gacctgaggg tccacttcaa 120 
gaacacaagg gagacagctt ttgcgctt eg caagctgcct ttgaccaagg etaagegata 180 
ccttgaggat gttattgetc acaagcaggc aattcccttc eggagatact gtggaggtgt 240 
tggtcgcacc gcacaagcaa agtctcgcca ctccaatggg cagggtcget ggcctgttaa 300 
gtcagecagg ttcatattgg atttgetgaa gaatgetgag agtaacgctg atgtgaaagg 360 
cttggacgtg gacaacctct atgtttcaca catccaggtg aaccaagccc agaagcagag 420 
gcgccgcacc taccgtgctc aeggaegcat taacccttac atgtcctccc cttgccacat 480 
tgagctgatt ctgtcagaga aggaagagee tgtgaagaaa gaggctgaca acattgtege 540 
ggcaaggaag cagtgaagta tgttagcctg ctgataatgg catcctttcg aagcattata 600 
gccgtagctc ttttatgtac cttttcaggc tggtaccgat accgtggatc tggatttget 660 
atgttttgta ctgtgctcga gtgtaaggtc agttcgtaat acaatcaatc tgcgtggaac 720 
aatttgatat cct 733 

<210> 178 
<211> 699 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misG_feature 

<223> Incyte ID No: DI : 257695 . 20 :2000MAY01 
<400> 178 

ggcgccgcgc ggtgaaggtt gtctagtcca egctteggag ccatgccgtc caagggcccg 
ctgcagtctg tgeaggtett eggaegcaag aagacagega cagctgtggg cgcactgcaa 
aegeggcaat ggtctcatca aggtgaaegg gcggcccctg gagatgattg agccgcgcac 
gctacagtac aagctgctgg agecagttet gcttctcggc aaggagegat ttgctggtgt 
agacatccgt gtccgtgtaa agggtggtgg tcacgtgccc cagatttatg gtgagtccca 
ggaactgggc gcatggagga ggtggctctg ggagggaggc cttcacagcg ctcctgtacc 
ctttaattgt gtgtctttct cacagctatc cgtcagtcca tctccaaagc cctggtggcc 
tattaccaga aatgtgagtg agcatgggtc cttcccatga ggtaggtggg tgtgtgggga 
tcaagtcaag gactctgtgt gattatctaa atcctcgtcc ctgctcttct tgccagatgt 
ggatgaggct tccaagaagg agatcaaaga catcctcatc cagtatgacc ggaccctgct 
ggtagctgac cctcgtcgct gcgagtccaa aaagtttgga ggccctggtg cccgcgctcg 
ctaccagaaa tcctaccgat aagcccatcg tgactcaaa 

<210> 179 
<211> 568 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI: 455771. 1:2000MAY01 
<400> 179 

ccgcttcttg cagccgtcgc cgcgaagcgc tccagagaac ttccgtcaac atggggaaga 
cacgtggtat gggagccggg cgcaagctca agacccaccg caggaaccag cggtgggctg 
acaaggcata caagaagagc catttgggca atgagtggaa gaaacccttc gctgggtcat 
cccatgccaa gggcattgtc ctggagaaga ttggtattga ggccaagcag cccaactccg 
etatcegtaa gtgtgctcgt gttcagcttg ttaagaatgg caagaagatt gctgccttcg 
tgccaaatga cggttgtttg aactacattg aggaaaatga tgaggtcttg attgetggat 
ttggtcgtaa ggggcacget gtgggagata ttcctggtgt ccggttcaag gtegtcaagg 
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tttccggtgt gtctctgctt gcccttttca aggagaagaa agagaagcca aggtcttaga 480 
ttgctcttgc taccaaaatc agcaagcgtg gagttgaaac gggagggcgt tagatgatta 540 
agaagaatgg ttgcttgcta tgtttgca 568 

<210> 180 
<211> 444 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI:274551. 1 :2000MAY01 
<400> 180 

gtcacatcct tggcctgaaa ttctttgctg tagcccttca ctgtgacaca actgcaacca 60 
accactttac aaagttttcc ttgtctatca gttttacaaa ggcttagcca tgtctctggt 120 
ttcttaaggt catcaacctt aattgggttg atttggtgtt cggcacaaag ggccacccct 180 
agcttgacat acataggctc ttcacatttg gatgcaagca cgcaaagatg ggcttggccc 240 
ttgtctaagg ctttggcagc tttgcaggtt ccacctgcta ggccatcatg gctgagggcg 300 
gtcttcagtc tcttgtagag cagtattaat atccattaca cctccagcag cgtgtcttcc 360 
ttggcatggt ggtgggtaca ggtgaaggtg aatcttgagc cactcaactt ctgctctgag 420 
catgttgcca tgcagagaaa gaga 444 

<210> 181 
<211> 779 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> raisc__feature 

<223> Incyte ID No: LI : 035973 . 1 :2000MAY01 
<400> 181 

tatagggtaa acatccaatt aaaaggcaga tattggctat ccttggggag gccagcaaat 60 
ttccggataa gggaagctgg tgggcaggaa aaaggcatgc aatggccctc atgagccaga 120 
agaatagctg ctgactgatg cctgggggac caagcccata attgtggagc tggagccgca 180 
ctgtgtccct tgcttcagtc tgatctttgg ggagcctgca cagttctgac ctcagttaag 240 
agctaaggga agagatttgg gaatgtttgg ccctgtgtaa cagcaaaagg attcattggt 300 
gagggtgtct ggcaggtatc cgtaaagata ataagatgaa gggaacatcg ccgtttggaa 360 
agtgtcgtga tatgatacac aagttgtgct gcctctgtgg ctctaaggca taccaccttc 420 
agaagtcaac ctgtggcaaa tgtggctccc ctgccaagcg caagagaaag tgtaactgga 480 
ctgccacggc taaaagaaaa taccacggcg actggttgaa tgaagcacct aaacattgta 540 
tactgcagat ttaggcatgg attctttgca ggaacaacac ctacacccaa gagggcagca 600 
gttgtgtcat ccagttcatc ttaagaattt caatgattag tcacacaata aatattccgg 660 
tttttaaaaa tgtatatatt ttaaacatat atatgtttat atgtatatgt tatatctgta 720 
ttacatatat gtgaaaagag gcagagattg tcagattgga ttaaaaagct gtctgtaag 779 

<210> 182 
<211> 524 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 978427 . 5 :2000FEB18 
<220> 

<221> unsure 
<222> 453 

<223> a, t, c, g, or other 
<400> 182 

gtggtgggta tgtgacggat gcctctgttt taggactacg ccggctgtgc tgttctggca 60 
gtggattaac cagtccttca atgccgtcgt caattacacc aacagaagtg gagacgcacc 120 
cctcactgtc aatgagttgg gaacagctta cgtttctgca acaactggtg ccgtagcaac l'80 
agctctagga ctcaatgcat tgaccaagca tgtctcacca ctgataggac gttttgttcc 240 
ctttgctgcc gtagctgctg ctaattgcat taatattcca ttaatgaggc aaagggaact 300 
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caaagttggc attcccgtca cggatgagaa tgggaaccgc ttgggggagt cggcgaacgc 360 

tgcgaaacaa gccatcacgc aagttgtcgt gtccaggatt ctcatggcag cccctggcat 420 

gggcatccct ccattcatta tgaacacttt ggnaaagaaa gcctttttga agaggttccc 480 

catggatgag tgcacccatt caagttgggt tagttggatt ctgt ~ 524 

<210> 183 

<211> 2340 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG:247781.2 :2000FEB18 
<400> 183 

ggcaggggcc ccgtgtctta ctagcgatgc cgtatctccc caacagtgct gggtatcatc 60 
acctatgcgg gcatcgacct ggccgtctac gagactctga agaactggtg gcttcagcag 120 
tacagccacg actcggcaga cccaggcatc ctcgtgctcc tggcctgcgg taccatatcc 180 
agcacctgcg gccagatagc cagttacccg ctggccctgg tccggacccg catgcaggca 240 
caagcctcca tcgagggtgg cccccagctg tccatgctgg gtctgctacg tcacatcctg 300 
tcccaggagg gcatgcgggg cctctaccgg gggatcgccc ccaacttcat gaaggttatt 360 
ccagctgtga gcatctccta tgtggtctac gagaacatga agcaggcctt gggggtcacg 420 
tccaggtgag ggacccggag cccgtccccc caatccctca ccccccacac ctcagccact 480 
ggagactgat gatccaacca caggatccct actctttggc cacgagatcc cagtacccag 540 
atcctggatc ctagactcct atgccccaac cattgggtca tgggatccca gcatccagat 600 
cctggatcct agactcctat gccccaacca ctgggtcatg cgatccccac ccttcagcca 660 
ctagatccca gatccccctg taaccataac tgtggatccc ttacttcagc aactcaagtc 720 
tgctacccta accacaagat tcaagattat ccacacccca gcccttaatc cccatccccc 780 
aaatcactgg atcctgcagc cccacatcct aaggtggatc ccacgcttcc ctgtgccccc 840 
tactggatcc tggacctcta cgtcttaacc actggatccc acacaaatca gtgaatggat 900 
cccaacaccc caaccacagg agcacggatt ccctgtacct caacacccag accctgcctc 960 
cctcaggcac cagatccagt gtcctagtga aacgctggat cctagatccc caaccccaga 1020 
tccccatgcc tcgagccctg gatctccaag ctcagctgct ggattctgga tgtcaacaaa 1080 
cctcaccact ggatcctgac aaccacaatg cctggatcct ggggccccca tcactggatc 1140 
ccagatcccc tcactccacc cactggattc ctgcattggt ttttggtttc ctgctccttt 1200 
ttaacctcga cactgggtct cagatccttc tgctgactgc cagatccctg catttcaagc 1260 
actacgcctt ccacccccag gcactggatc ccagattccc aagccttcac ccaccagatt 1320 
ctggctccta aaacaagtgc gggggcccca gtggcacagc aagtggatcc tggcaactgc 1380 
agctgctgga ttccagattc tgggtcccca atccctctgc ccagtccctc aatgttgaaa 1440 
cctcatctct tgaaggcaga tcctgatatt ccaaggcact gaatcccaag ccctgaatcc 1500 
ccggtttctg atctgaatct tccaggcgcc gggtcccaaa tgttcaggcc ccaagtctag 1560 
atcctggcag cccagtcaca gagtatccca cacacactgg tgcccagagc cggcttctca 1620 
tgacatgaaa ttgcatggtc gagggagtct gtggggaagg aagcccaggt cctggctgca 1680 
acctgcacgg atgctggatt ccccctcacc ccacctctgc atggccaccc cctcccagcc 1740 
ctgtggggaa actgttccct ggaaccactc cactccctgc atccccacac ttcacagcat 1800 
cttccatccc cctcccacct tctaggcgaa tagtccccag agctgtgttc ctccaagggg 1860 
tccgaggaat cactcactcc tggaggctgg caaggagaca gtctgaggcc agggacacat 1920 
gaagggatgt ccccacccca gcactatcag ggcctcccca ggcttccaga gttgaaagcc 1980 
aggagaaaat cggcaaagac cacccttccc taaacccaag cacccaatga tgcaaaaaac 2040 
aaaaacaaaa aaaaaccacc aaatccccaa attcattcca gatctatttt tctaccagag 2100 
agaggagcaa agtcctcctc ccctgcgccc ttacattctg cacttcatag ttggattctg 2160 
agcttaggat catctggaga ccccatggag ggacttggaa aggggaactg ggatttgggg 2220 
aggggctgga ggacttccgc acgcttccac ctccttcgac ctccactgcg ccccacctcc 2280 
ctgcctgtgt gtgttatttc aaaggaaaag aacaaaagga ataaattttc taagctcttt 2340 

<210> 184 

<211> 1199 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 034583 . 1 : 2000FEB01 
<400> 184 

gtgcagcatg ccaccgtgat tcccgagacc atggcaggca cccagcagct ggcggactgg 60 
agaaatacgc atgctcacgg aagccattat aatcccatca tgcagcagcc tgcactattg 120 
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accggtcatg tgacccttcc agcagcacag cccttaaatg tgggtgtggc ccacgtgatg 180 
cggcagcagc caaccagcac cacctcctcc cggaagagta agcagcacca gtcatctgtg 240 
agaaatgtct ccacctgtga ggtgtcctcc tctcaggcca tcagctcccc acagcgatcc 300 
aagcgtgtca aggagaacac acctccccgc tgtgccatgg tgcacagtag cccggcctgc 360 
agcacctcgg tcacctgtgg gtggggcgac gtggcctcca gcaccacccg ggaacggcag 420 
cggcagacaa ttgtcattcc cgacactccc agccccacgg tcagcgtcat caccatcagc 480 
agtgacacgg acgaggagga ggaacagaaa cacgccccca ccagcactgt ctccaagcaa 540 
agaaaaaacg tcatcagctg tgtcacagtc cacgactccc cctactccga ctcctccagc 600 
aacaccagcc cctactccgt gcagcagcgt gctgggcaca acaatgccaa tgcctttgac 660 
accaagggga gcctggagaa tcactgcacg gggaaccccc gaaccatcat cgtgccaccc 720 
ctgaaaaccc aggccagcga agtattggtg gagtgtgata gcctggtgcc agtcaacacc 780 
agtcaccact cgtcctccta caagtccaag tcctccagca acgtgacctc caccagcggt 840 
cactcttcag ggagctcatc tggagccatc acctaccggc agcagcggcc gggcccccac 900 
ttccagcagc agcagccact caatctcagc caggctcagc agcacatcac cacggaccgc 960 
actgggagcc accgaaggca gcaggcctac atcactccca ccatggccca ggctccgtac 1020 
tccttcccgc acaacagccc cagccacggc actgtgcacc cgcatctggc tgcagccgct 1080 
gccgctgccc acctccccac ccagccccac ctctacacct acactgcgcc ggcggccctg 1140 
ggctccaccg gcaccgtggc ccacctggtg gcctcgcaag gctctgcgcg ccacaccgt 1199 

<210> 185 

<211> 1012 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 333307 . 2 : 2000FEB01 
<220> 

<221> unsure 
<222> 965 

<223> a, t, c, g, or other 
<400> 185 

aacagatgtg aaaaaagaat tatgacaagg acaacctaat ataagtttta taaattttag 60 
gaaatgtcat ctttttactg gtttcagaga ctttttcaaa tgttttaaac acaattttca 120 
gataacaata atgaaaagct gagccccaaa ccagggacag gtgaaccagt tttaagtttg 180 
cactacagca cagaaggaac aactacaagc acaataaaac tgaactttac agatgaatgg 240 
agcagtatag catcaagttc tagaggaatt gggagccatt gcaaatctga gggtcaggag 300 
gaatctttcg tcccacagag ctcagtgcaa ccaccagaag gagacagtga aacaaaagct 360 
cctgaagaat catcagagga tgtgacaaaa tatcaggaag gagtatctgc agaaaaccca 420 
gttgagaacc atatcaatat aacacaatca gataagttca cagccaagcc attggattcc 480 
aactcaggag aaagaaatga cctcaatctt gatcgctctt gtggggttcc agaagaatct 540 
gcttcatctg aaaaagccaa ggaaccagaa acttcagatc agactagcac tgagagtgct 600 
accaatgaaa ataacaccaa tcctgagcct cagttccaaa cagaagccac tgggccttca 660 
gctcatgaag aaacatccac cagggactct gctcttcagg acacagatga cagtgatgat 720 
gacccagtcc tgatcccagg tgcaaggtat cgagcaggac ctggtgatag atttaatatc 780 
agaggaacaa caataggtga tagaataatg agacgctctg ctgttgcccg tattcaggag 840 
ttcttcagac ggagaaaaga aaggaaagaa atggaagaat tggatacttt gaacattaga 900 
aggccgctag taaaaatggt ttataaaggc catcgcaact ccaggacaat gataaaagaa 960 
gccantttct ggggtgctaa ctttgtaatg agtggttctg actgtggcca ca 1012 

<210> 186 
<211> 318 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 814710 . 2 :2000FEB01 
<400> 186 

cgaaggacct gaaggaaagt ggaaaagtcc aaagtttaag atgccagaga tgcattttaa 60 
gactccaaag atatccatgc cagatattga cctgaatctc acaggtccaa aaataaaagg 120 
agatgtggat gttacaggcc ctaaggtaga gggagatctg aaaggtcctg aagttgacct 180 
caaaggcccc aaagtggaca ttgatgtccc agatgttaat gttcagggtc cagactggca 240 
cctgaagatg cccaagatga aaatgcctaa gttcagtatg cctggcttca aaggagaggg 300 
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cccaggttct agatcgcg 318 

<210> 187 
<211> 677 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 414732 . 1 :2000MAY19 
<220> 

<221> unsure 
<222> 45, 618 

<223> a, t, c, g, or other 
<400> 187 

agaggagcga aaagcatggt ccggggggcg gcggacgcct ggtangcgct cgggccgttg 60 
ttgtggggct gcgcgctggc gctgcagggc gggatgctgt acccaagaga gaggccgtcg 120 
cgggagcgga agaagctgga ccgcctctgg agcttccgcg ccgacttctt cgacaaactg 180 
tgccttggct tctaggagta gtggtatcgg cgtctgctgc gagagtcggg ctccaccatg 240 
gacataccgg ttccctccag cttcaacgac gttggccagg actggcggct gcggcatttt 300 
gtagaccaga tgtggtacga acgggaggtg accttcctgg agcaatggac ccaggacctg 3 60 
cacacaagag tggtactgag gattgtcagt gcccactcct atgccatcgt g.tgggtgaat 420 
ggggtcgacg cgctagagca tgagggatct acctcccctt tgacaccgac atcagtagcc 480 
tgttccaggt ggggcccctg ccctcccgcc tccgcatcac tatcaccatc ggcaacatgc 540 
tcatctcctc caccctgcca ccagggagca tcctcgacat ggccgacacc tccacgtggg 600 
taccatcctg cttccacngc agacacccac cttcctgtcc caccccgtgg ggcattacat 660 
tagggtaatt acattag 677 

<210> 188 

<211> 1295 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No : LG:413910. 6 :2000MAY19 
<220> 

<221> unsure 
<222> 1275 

<223> a, t, c, g, or other 
<400> 188 

gatttggctc ccagctttgt agttcggaag aagttgggtc tatagatttc cccctaactc 60 

tccattgatg tgttgagctt cagagggaat aataactcta cgtaaagcat gctggcgtct 120 

caaggagttc tcctgcatcc ttatggcgtg cctatgattg taccggcagc tccttacctt 180 

cctggactga ttcagggtaa tcaggaagca gccgctgccc ctgacacaat ggctcagcct 240 

tacgcttcgg cccagtttgc tcccccgcag aacggtatcc ccgcggaata cacggcccct 300 

catccccacc ccgcgccaga gtacacaggc cagaccacgg ttcccgagca cacattaaac 360 

ctgtaccctc ccgcccagac gcactccgag cagagcccgg cggtcttttt atttgtcatt 420 

acacgtgctg tggcattatt tacctcgatt ttacggccct ctaccacggt gccgtgtaat 480 

ttctccctcg ccctgtccgc atcggcacta ttttcgaaag ttacgaaacc aaatcccttt 540 

gagcctcgct cattaaaaat aatttcaaca tctaagattt taccaaattg accaaacatt 600 

tgtctgaggt ccggatcccg gaacctgaag gggatattgg agacatgcag ccgcttgggc 660 

tgagacttgt tttccgtgtt ttcagaaggt tgtgtctggg gctggccatc cgtcggtgct 720 

gcgtcatctg tctgtgtggc ggtgccagag acggtctgag cgctcgtgtc cgccgggctc 780 

tgctcggagt gcgtctgggc gggagggtac aggtttaatg tgtgctcggg aaccgtggtc 840 

tggcctgtgt actctggcgc ggggtgggga tgaggggccg tgtattccgc ggggataccg 900 

ttctgcgggg gagcaaactg ggccgaagcg taaggctgag ccattgtgtc aggggcagcg 960 

gctgcttcct gattacccct tagctgctct ctttcacaat tcatctgttg tcttgaaacc 1020 

tttctgtctt ctatttccac gcaggtattg cagtctgctt tgatatcctg aagggcatct 1080 

cccacgactg caattagctg ccaagatttc attgtcggct gcaaagtgcc tcctgtttcc 1140 

tcaccgggaa aacacagcac gggagaagtc cgaggcaaca gcaaacggca cccacgtgct 1200 

agcaatgtaa atgactccag tcatctgtcg ttgcgcgccg cgcgccttcg ctcgcccggg 1260 

ctgcctggca cgacngccag caggtgcaga acgcc 1295 
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<210> 189 
<211> 751 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> raisc_feature 

<223> Incyte ID No: LI : 414732 . 2 :2000MAY01 
<400> 189 

gagagaggag cgaaaagcat ggtccggggg gcggcggacg cctggtaggc gctcgggccg 60 
ttgttgtggg gctgcgcgct ggcgctgcag ggcgggatgc tgtacccaag agagaggccg 120 
tcgcgggagc ggaagaagct ggaccgcotc tggagcttcc gcgccgactt cttcgacaaa 180 
ctgtgccttg gcttctagga gtagtggtat cggcgtctgc tgcgagagtc gggctccacc 240 
atggacatac cggttccctc cagcttcaac gacgttggcc aggactggcg gctgcggcat 300 
tttgtagacc agatgtggta cgaacgggag gtgaccttcc tggagcaatg gacccaggac 360 
ctgcacacaa gagtggtact gaggattgtc agtgcccact cctatgccat cgtgtgggtg 420 
aatggggtcg acgcgctaga gcatgaggga tctacctccc ctttgacacc gacatcagta 480 
gcctgttcca ggtggggccc ctgccctccc gcctccgcat cactatcacc atcggcaaca 540 
tgctcatctc ctccaccctg ccaccaggga gcatcctcga catggccgac acctccacgt 600 
gggtaccatc ctgcttccac cgcagacacc caccttcctg tcccaccccg tggggcatta 660 
cattagggta attacattag gagaacttgt tgtttaaaag agtgtggcac cttcatccac 720 
ctttttcctt tcttgccatg tgacacacca g 751 

<210> 190 

<211> 4466 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI :900264.2:2000MAY01 
<400> 190 

tgaaagtact ttgactccca agggtatctg atctgatgga gagaatgtat cacttgtacc 60 
gatgcggata gaacgctcat taatfcacagc acagttttca taaattccaa cttccctagg 120 
aattttgtct tttgaaggca tgttggaagt tggggcatta ggatataatc tcagtccatg 180 
tactaagcct ctgcataact gaaagataga gtcaataaaa ttcactctct cattttttgt 240 
ttttaatctc aagaattttt gccttgaagg acgaatgtag tggcgaattt gatgtttgaa 300 
cgaatcccag tcagaagttc cagcctgcca ctgttctctg atgccatgct atgaccaaca 360 
tcaagctgta ttgtcctctc atccgtaact gtgaactgag caggatctat ggcactgcat 420 
gttactgtca ccagaaacat ctctgtttgt tcctcatcgt acattcctca gagtcgactg 480 
agatacacac ctcatccagc atatgctacc ttttgcaggc caaaggagaa ctggtggcag 540 
tacacccaag gaaggagata tgcttccaca ccacagaaat tttacctcac acctccacaa 600 
gtcaatagca tccttaaagc taatgaatac agtttcaaag tgccagaatt tgacggcaaa 660 
tagtgtcagt tctatcctgt cgatttgaca gcaatcagct gcctgcaaat gcacccattg 720 
aggaccggag aagtgcagca acctgcttgc agaccagagg gctccttttg ggggtttttg 780 
atggccatgc aggttgtgct tgttcccagg cagtcagtga aagactcttt tattatattg 840 
ctgtctcttt gttaccccat gagactttgc tagagattga aaatgcagtg gagagcggcc 900 
gggcactgct acccattctc cagtggcaca agcaccccaa tgattacttt agtaaggagg 960 
catccaaatt gtactttaac agcttgagga cttactggca aggagcttat agacctcaac 1020 
actgggtgag tcgactgata ttgatgttaa ggaggctcta attaatgcct tcaagaggct 1080 
tgataatgac atctccttgg aggcgcaagt tgggtgatcc taattctttt ctcaactacc 1140 
tggtgcttcg agtggcattt tctggagcca ctggcttgtg tggcccatgt ggatggtgtt 1200 
gaccttcatg tgggccaata ctggcgatat cagagccatg ctgggtgtgc aggaagagga 1260 
cggctcatgg tcagcagtca cgctgtctaa tgaccacaat gctcagaatg aaagagaact 1320 
agaacggctg aaattggaac aatccaaaga gtgaggccaa gagtgtcgtg aaacaggatc 1380 
ggctgcttgg ctttgctgat gccatttagg gcatttggag atgtcacgtt ccaatgggag 1440 
cattgagctt tcaagaagag agtgatagaa tctggcccag aaccagttga atggacatga 1500 
attttccaag tttattcctc ctaattatca cacacctcct ttatctcact gctgagccag 1560 
aggtaactta ccagcgagtt aaggccacag ggataagttt ctggtgttgg ctactgatgg 1620 
gttgtgggag actatgcagt aggcaggatg tggttaggat tgtgggtgag ttcccttaac 1680 
tggcatgcat caccaacagc caatagctgt tggtggctac aaggtgactc tgggacagat 1740 
gcatggcctt ttaacagaaa ggagaaccaa aatgtcctcg gtatttgagg atcagaacgc 1800 
atgcaaccgc gatcgtcatt cggccaccgc tgtgggcaac aacgagtttg ggactgttga 1860 
tcattgagcc gcttctctaa aatgcttagt cttcctgaag agcttgctcg aatgtacaga 1920 
gatgacatta caatcattgt agttcagttc aattctccat gttgtagggg cgtttcaaaa 1980 



92/228 



WO 01/62927 



PCT/US01/06059 



cccagaatag tgagtggctc tttcactggc aattctcaaa tgatacacat ttaaagggca 2040 
gattttctta aaaagatact actataataa acatttccag ttggtcattc taagcattta 2100 
cccctttgga tactctagct agtcaggtac tccaaattga cctttgcagc agggtggcag 2160 
ggtcaggaga gtctggtcct gcctagctca gatttcagtg gcagcctgca cttgaagcaa 2220 
cgtcacttct ttatcacagg tcgtcttgag acattcagct tcttttacca aacctgagaa 2280 
aattaggatg cacctggcca aataagatct tgaataggcc aaaagcaagt atcttgctgt 2340 
gtgtagtctc ttggttcaag tgaagcaaca gtactgttca cacctttctt cactgagatt 2400 
cccagtgtac atgagaacat atatttattg catgattttc ctagatacac agtcctatgc 2460 
attattcata tacatttatt ttagcctaaa agtgggtttc aaatcccagt tccttcaagc 2520 
cataaatgac caaggtccaa ggcaatctgg aatttgtttt tgtgattatt tgactggaat 2580 
gcttcttaag tggaataacc tatactccgt tatccacccg atttcctaat gtaattgaaa 2640 
gattttccta ttttgccaca cacttggaga acaataatgg gtttttagtt ttatctactc 2700 
ttctattgaa gttaaagaaa gaaaaaaaga tttttttatt tggtattaat gaaaagcttt 2760 
agtttaaaat aaggagatcc agaataaaaa gaagagactg atctcttcaa ttattgtcat 2820 
ctgtagccac cagcacatca ctcttatgta atccccaaag gcttggcatg ccgtaagtgt 2880 
gtggtgggta gactgctgcc ggggaatcgt acttcttatt tagtaatgat aagacttttc 2940 
attaattttg ccatttttaa agatgacata aataagttta aatatcaatt tggggagtaa 3000 
ggtttaatat tgccatcggg tattgagaca ggaggaagtt tctgtttttc tccatttaga 3060 
ccataggtca attaaaatat ttgggtttaa aaattgacct aaatgcttta aacatattgt 3120 
agcttaagat atatgtgtta agatatatac atgagaaact ttaaaaggta actactgtgc 3180 
atgcctgatg cttaatagaa tacttagtgg catcaaatgt ttgcagcagt ctccatgagt 3240 
tatgttcggt cccttctaat actgtatcaa tgtaaatgaa ataaagtata ttcaaattgg 3300 
ctttttgata tgcatcaagt ggcattttgt ctcctgtgtt taatagtgat ctgtatacag 3360 
gctgtgcaca tattgtcatc acttattcta gcatcactgt taaaggctgt gattatgttt 3420 
gatattcacc tgcgatttta atacaagcca atatcagctt cccattgtgt aataacttgg 3480 
gtgtttagga gtcttttcac attttttggc gatatgaact agatgttcca agaaactcct 3540 
tcctggactg tggatactga atcagtgtac tattggctgc agaatttgtt tcaattgaaa 3600 
atagactcag gaagattgct gctcagaata tcatatagat gtctattttt tgaggtgttt 3660 
ttgtttttat ttgtgtgttt ttcttttttt taagtcagct tggaactttt ttcctgggta 3720 
gtatttggga gagggaaagg ctgtcactat atatttattt ctctaatgtt ttggctcggg 3780 
cctttttctc tttaatgaga atatagtgtg gctggtctca tgacaaaacc catattacag 3840 
tctactatga gaatattctt gaacaccacc actgaagagt ttcatataca ccaaataatg 3900 
tctcatctct atagtacagg gaatataaaa ttggtttcct gtggtcatga tccagatagt 3960 
aggtattatt accacaagaa cttgggtctg ccagtcctgg aaagcttgtc tgctctatag 4020 
aaatgaaaat gcagcatgaa gttgacattg tggaaatgaa aagtaattgg gtattagaaa 4080 
tctgaaagta ctgtccatct aaaagcaatt gtggatttta ttgtaattgg ttgtcccctt 4140 
ttgttccggt ttcttagaat taaaagaata catgtaaacc tttccatggt atttaagccc 4200 
tttccttaaa tttttttaaa atttaaactt tcctaaccct atgtattcaa cttctgtatt 4260 
tatatttaat ccagtggttc atgttatata atacaccctt aactagttaa atggaatgtt 4320 
ggtatggtac agagtaccat attgctaaga aaactgtctt ataaaagatg tatatggtgt 4380 
gaagacatga aagtttaatg taacagaatg gttggagaaa tgcctatggt gaattaaagc 4440 
ttcatatctg ctttctgaaa aaaaaa 4466 

<210> 191 

<211> 1798 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 33 5593 . 1 : 2000MAY01 
<220> 

<221> unsure 

<222> 111, 145, 147 

<223> a, t, c, g, or other 

<400> 191 

aaccagcagc gggaggagcg caacgctagc gcggcgcggg gaccgccgcg cttccttccc 60 

ggctcctgcc ctcctctgcc gctcgctcct ctctcggcat ccacctcctt nccctcctcc 120 

tgctcattca ctctttccct cactntnccc tcctcctcgg cttctccccg cgccccgcca 180 

gccctcgcgc tctcccactc cctctgcctg tcctccccgg tcccctcctc ccgctcattc 240 

actcgcccct ctcccttctc cactctctcc ccctgtctcc tttcttcttc ttctttcacc 300 

ctccgtctct cacaccccct ccattcccct gtctcctttc tgacactgca ctgcagctgc 360 

tcctcagccc tgccccctcc ccagtgagaa caaaccagca acattgcttt ttttcctaaa 420 

gagatttata ttgatccgat taaaaaaaaa aaaccttaag aaacccccga cgcccgaaag 480 

aaaaagaaaa aaaaaaagaa aaaagaaaag aaaaagccaa aacaaaaggg agaaccttct 540 
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cccggtagca gcggcaggaa ctgcaaacat gatggcggca gctcccatcc agcagaacgg 600 
gacccacact ggggttccct atagacctgg acccgccgga ctcgcggaaa aggccgctgg 660 
aagccccccc tgaagccggc agcaccaaga ggaccaatac gggcgaagac ggccagtatt 720 
ttctaaaggg ttctcatacc tagttatgct gctggatcta taattgggga agggaggaca 780 
gacaattgtt cagttgcaaa aagaaactgg agccaccatc aagctgtcta agtccaaaga 840 
tttttaccca gg tact age t geagegagtg tgctgtgagt ccagggaact gttgaagtgc 900 
tgaatgcatg gttcatggca ettcattgea tcagacacat agttcgagaa tatgecctat 960 
aaatatggcc aagacagaac cagtcagcat tctacaaccc cagaccaccg ttaatccaga 1020 
tegcatcaaa caaacatttg ccatcttccc caatctacca ccaagtcctc tccatctgat 1080 
cccatgagcc ccctccagag ctaatcaggt acagtatcat cctgtcatct acaccatact 1140 
tttcaccggg tgattgcctg caaaactgga gcaacacaag atcatggtgg gttccaggat 1200 
ttcattcgtt cattcacata tatgtcacca tagtcagatt tccaataacc tatagaaaga 1260 
aatatctcat tattttctca acattttcta tgectttcta attttagata acaaatacta 1320 
tattcagata ttataactac tctaattttt taattaattt ttccctaaaa atttgactac 1380 
ctaatggggc acctgaagcc tatttattga aattggtaat ttttaatctc cccttattgt 1440 
atcttcctag gaatttttca aaagtaaatt aaggatatct atatatgtaa aaaaaaaaat 1500 
gtataaattg agcaaaggta tatctttaaa agtgaatgca attttatttg ctttttattt 1560 
gagacatgee ttatgatgta aacaactttc aactattgat gaaaagcatt tctagcaatc 1620 
taaaagaaac tgaaaattgg tatttgtggt aattttaatg tttaacaact caggagatgg 1680 
tgttttctta gtattttaaa aaattctaag cattttcatt getctgetga caaactttgc 1740 
caagtagaat atattttatt agtttggatt tatggattgg tttgggtact atttgtag 1798 

<210> 192 

<211> 3690 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 1189543 . 1 : 2000MAY01 
<220> 

<221> unsure 
<222> 2889 

<223> a, t, c, g, or other 
<400> 192 

gaattccgac aaaacaaaag ggagaacctt ctcccggtag cageggcagg aactgeaaac 60 
atgatggcgg cagctcccat ccagcagaac gggacccaca ctggggttcc catagacctg 120 
gacccgccgg actcgeggaa aaggccgctg gaagcccccc ctgaagcegg cagcaccaag 180 
aggaccaata egggegaaga eggecagtat tttctaaagg ttctcatacc tagttatgct 240 
gctggatcta taattgggaa gggaggacag acaattgttc agttgcaaaa agaaactgga 300 
gccaccatca agctgtctaa gctgtctaag tccaaagatt tttacccagg tactactgag 360 
cgagtgtgct tgatccaggg aacggttgaa gcactgaatg cagttcatgg attcattgea 420 
gaaaaaattc gagaaatgee ccaaaatgtg gecaagacag aaccagtcag cattctacaa 480 
ccccagacca ccgttaatcc agategcate aaacaaacat tgccatcttc cccaactacc 540 
accaagtcct ctccatctga tcccatgacc acctccagag ctaatcaggt aaagattata 600 
gttcccaaca gcacagcagg tctgataata gggaagggag gtgctactgt gaaggctgta 660 
atggagcagt caggggcttg ggtgcagctt tcccagaaac ctgatgggat caacttgeaa 720 
gagagggttg tcactgtgag tggagaacct gaacaaaacc gaaaagctgt tgaacttatc 780 
atccagaaga tacaagagga tccacaaagt ggcagctgtc tcaatatcag ttatgccaat 840 
gtgacaggtc cagtgggcaa attccaatcc aaceggatet ecttatgeaa acactgetga 900 
agtgttacca actgetgeag cagctgeagg gctattagga catgctaacc ttgctggcgt 960 
tgcagccttt ccagcagttt tatctggctt cacaggcaat gacctggtgg ccatcacctc 1020 
tgcacttaat acattageca gctatggata taatctcaac actttaggtt taggtctcag 1080 
tcaagcagca gcaacagggg ctttggctgc agcagctgcc agtgccaacc cagcagcagc 1140 
ageagecaat ttattggcca cctatgccag tgaagectea gccagtggca gcacagctgg 1200 
tggtacggcg gggacatttg cattaggtag cctggctgct getactgetg caaccaatgg 1260 
atattttgga gctgcttctc ccctagctgc cagtgecatt ctaggaacag aaaagtccac 1320 
agatggatcc aaggatgtag ttgaaatagc agtgccagaa aacttagttg gtgeaatact 1380 
tggcaaagga gggaaaacat tagtggaata ccaggagttg actggtgcaa ggatacagat 1440 
ctccaaaaaa ggagaattcg tacctggcac aaggaategg aaggtaacca ttactggaac 1500 
accagctgca acacaggctg ctcaatattt aattacacaa aggatcacat atgagcaagg 1560 
agttcgggct gccaatcctc agaaagtggg ttgagtgccc cagttacaca tcagattgtt 1620 
ttaacccctc ctttacccca ttttcaagaa ggatgtactg tactttgeag aagtgaagtt 1680 
tttctgttat taatatataa ttatgcaaat gaatgegact atgttgacaa tgtgtatatg 1740 
taaataatat gtgttttacc agatgtttca tagaaagaat tttttcttga tctgttttgt 1800 
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tctctatact ttgcttgtgt atatttgtca gaggtgtttc tagtgtaaga tttaagcctg 1860 

ccattttacc agcattattg tagtttaatg attgaatgta gacagggata tgcgtatagt 1920 

tttcagtatt agttctagat aacactaaat taactactgt taggttgagt atggtggggt 1980 

cagtgaccta aaatggagtg aggccaaagc actgtcctgt aagtcttact tcctgcttag 2040 

ggcacagtga cagtcggaaa catatttcga aactaacgtc tcaaatttaa aatgatcaaa 2100 

accgccatat acgttgccct acaacgccct gtcaactatt ctacacggtt aaactgtgga 2160 

ccattatatg ggtaagttta cagatcaata aaagcacctg ttctccatct gaactagaca 2220 

atggaaataa tgctgcatgc tggccatggc ccattcttca tcatttgtaa gttcaacaaa 2280 

agttctcaca tggagtccca cctcttcaga ggtttgtaca tttgttttta agcactgaat 2340 

tcactactga tcccatcgcc tggccagtag aacagtcatt actccattaa catcctcact 2400 

gtttagacac ataactgtgg tacagtgtat tggaaatttt ataaacaaaa gtgaaagtgc 2460 

caacaaatta ttgatagctg ataatgtttc attatctgca actgcttgat aagtatgttg 2520 

cattttaaga gcttataatt gtgtataatt tgttaacact agaaacctat tagtattgtg 2580 

aatgtagatt ttactgtgaa gctatctgtg atttagctgt ttgctcccat gatggagtct 2640 

ttgcagcatg gcgctagcag ccaatgcagt ttctaatact cggtaatttg catgttttgt 2700 

ggagcatttt tatgtcacca accagacagt atttcctgca tgcttattta gaagaggcag 2760 

cttatcttga gaggtagtgt tatctacctt tgtcaggctt tttgacaggt catttcagag 2820 

taagcctttg ttcccaagac ccaacaactg tcaccctctt ctgtacctct cctgagtgcc 2880 

aactgtgcna ggccatttga cacaccatct gttaacctct gagtttgccc actcaaggcc 2940 

actcataggg gcatcctagc cctgtgcact cagcactcat aggatcatcc agactctcat 3000 

gcggcatgca gtctaatcat gacaaataat gctgctactc tgatatctgg ctgagcaact 3060 

gaattacaaa agagaattac ttccatctca acttcaaccc attgattacg tccatcctag 3120 

caagctaaat ggcatcccag ctgctccttt ctgtgcaacc aattaaagaa caatgagtgt 3180' 

gatgctccat gtctgaattt cgtccagcct ctctctgaac tgtgatcttt gtcctcatga 3240 

actttccctt ttgttcattg aactatatgg actctgtcat ttcatattga tttactgtgc 3300 

aatttacttt tggacattga gaacttgaaa ttatttcctg atcccttccc cttccactat 3360 

taataattca tttctgtcaa actgtaagag tagactcatt tttttttttt tagtttttaa 3420 

cattggactg ttatttcatt tagagttctc tatctctaaa tatttattta gagaatgatt 3480 

ttaaaaggga atgatatgcc ttgtttaaat gaaagagaaa agctgtagta aactgtgtta 3540 

attggtaatg actatttatc gtcgatactc tgtagctgtg taagttttga caaatagtgt 3 600 

atctcgtgga atcagtggtt agcattgccg ctattatatt tactcatttt atcattataa 3 660 

atgtgcttag ttcatcatgt aaaaaaaaaa 3 690 

<210> 193 
<211> 588 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 455450 . 1 : 2000FEB18 
<400> 193 

ggcggcggca tccttcgccg ctccggcagc agctccagct ccagcagctc gtcatctgaa 60 

gacgacggca tgggtgggcg ccggaagaag ggcctgaagg agaagatcaa ggagaaaatg 120 

ccaggaggcc acagggaagg ccagggccag gcgacggcca ccggtgcgta cggcgggaca 180 

gggtacgtgg ctgggccgac gaccggaggc ccccacgaga agaagggtgt ggtggagaag 240 

atcaaggaga agatcccagg cggccacaag gactacgacc agcatcagca caccaccgcg 300 

gcaaccggtg gcggcggtgg ctatggcgga accaccgaca ccacgtatgg gacgacgacc 360 

actgaaggca ctcacgagaa gaagggcttc atggacaaaa tcaaggagaa actccccggt 420 

cagcactaag cactaagcag ctgcctcttt cctcttccgg ctagctgcct ctttcctctt 480 

ccggttggcg actgcctgta agtacatgta tacagaataa ataaagcaat aaagaataag 540 

cggcagctac gtatgtgtgt gtattcgtgt gtatgcattg taaggcag 588 

<210> 194 
<211> 640 
<212> DNA 

<213> Homo sapiens ' 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1040978 . 1 :2000FEB18 
<400> 194 

ctcctaccta cgcagcagag gccagccacc tccccgtcgc tctcacgctc ttcgtgcccg 60 

ccgcctctcc tccgtctctg cctctctccc tctcccctcc cgtctgacgc acatggcctc 120 

catcgccgga tcttcggccc tctccttcgc caggcccgtc aaggcaatca acacaaattc 180 
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tctagctttc tccccggcga ggaagggcaa cacattcctc cgcctgcaac caatgcctat 240 

gagatctgtt tcctgcgctg ccaagaagga tacaacagac aaggtttgtg agattgtgaa 300 

gaagcagctt gctcttcctg accatactga agtttgcggt gaatcgaaat tctctgaact 360 

cggtgcagat tcactggaca cggttgagat tgtgatgagt ctcgaggagc acttcgatat 420 

tagcgtggag gagtccagcg cacaaacaat cgccacagtt gaagacgccg cggacctcat 480 

cgacaagctt gttgctggca aagcgtagtt ctgtacctcc tctatcgttg tcctaccact 540 

ctaggtctat cccttgatct aggtttctat ggtctgaaga tttgtgattt tctccgcact 600 

gttgttgcta tccccgtgtc gtaataatac gagatgtttt 640 

<210> 195 
<211> 380 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 446649 . 1 :2000FEB18 
<400> 195 

gctgaaagcg cgggagcaag ctcagaagag ggaagcaatc caggtcacct cgccagtatg 60 
tctcaggctg attttgagaa aggcgggtga ggaggtgaag cgcctcaaga ctcagccaac 120 
tgatgaagag atgctgttca tctacagcca cttcaaacaa gctactgtgg gcgatgtaaa 180 
cacagatcgg ccggggctgt tggacctcaa gggcaaagcc aagtgggact cgtggaacaa 240 
gctgaaagga acttccaagg aaaatgccat gaagacctat gtggaaaagg tagaagagct 300 
aaagaagaaa tatggaatat taacgaccag atttggtggg cagccacatg tgtgacataa 3 60 
tacctgtgag gacataatgc 380 

<210> 196 

<211> 3854 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 132147 .3 :2000PEB18 
<400> 196 

ccgcgtcctc cgcggctcag gacgactacg ctcggcacgt ccgggaccct ccggccgtgg 60 

cggttgcagc gccaaaagct cgggcctcag cccctgacgc tgtggtgact ccgccgcgcc 120 

tcgccgtcgc ccctgtcccc gtcctgcccg ccgcctccgc cgctggggac atgtccaacc 180 

ccggaagccg gaggaacggg cccgtcaagc tgcacctgac agtactctgt gcaaaaaacc 240 

tggtgaaaaa ggattttttc cgacttcctg atccatttgc taaggtggtg gttgatggat 300 

ctgggcaatg ccattctaca gatactgtga agaatacgct tgatccaaag tggaatcagc 360 

attatgacct gtatattgga aagtctgatt cagttacgat cagtgtatgg aatcacaaga 420 

agatccataa gaaacaaggt gctggatttc tcggttgtgt tcgtcttctt tccaatgcca 480 

tcaaccgcct caaagacact ggttatcaga ggttggattt atgcaaactc gggccaaatg 540 

acaatgatac agttagagga cagatagtag taagtcttca gtccagagac cgaataggca 600 

caggaggaca agttgtggac tgcagtcgtt tatttgataa cgatttacca gacggctggg 660 

aagaaaggag aaccgcctct ggaagaatcc agtatctaaa ccatataaca agaactacgc 720 

aatgggagcg cccaacacga ccggcatccg aatattctag ccctggcaga cctcttagct 780 

gctttgttga tgagaacact ccaattagtg gaacaaatgg tgcaacatgt ggacagtctt 840 

cagatcccag gctggcagag aggagagtca ggtcacaacg acatagaaat tacatgagca 900 

gaacacattt acatactcct ccagacctac cagaaggcta tgaacagagg acaacgcaac 960 

aaggccaggt gtatttctta catacacaga ctggtgtgag cacatggcat gatccaagag 1020 

tgcccaggga tctta'gcaac atcaattgtg aagagcttgg tccattgcct cctggatggg 1080 

agatccgtaa tacggcaaca ggcagagttt atttcgttga ccataacaac agaacaacac 1140 

aatttacaga tcctcggctg tctgctaact tgcatttagt tttaaatcgg cagaaccaat 1200 

tgaaagacca acagcaacag caagtggtat cgttatgtcc tgatgacaca gaatgcctga 1260 

cagtcccaag gtacaagcga gacctggttc agaaactaaa aattttgcgg caagaacttt 1320 

cccaacaaca gcctcaggca ggtcattgcc gcattgaggt ttccagggaa gagatttttg 1380 

aggaatcata tcgacaggtc atgaaaatga gaccaaaaga tctctggaag cgattaatga 1440 

taaaatttcg tggagaagaa ggccttgact atggaggcgt tgccagggaa tggttgtatc 1500 

tcttgtcaca tgaaatgttg aatccatact atggcctctt ccagtattca agagatgata 1560 

tttatacatt gcagatcaat cctgattctg cagttaatcc ggaacattta tcctatttcc 1620 

actttgttgg acgaataatg ggaatggctg tgtttcatgg acattatatt gatggtggtt 1680 

tcacattgcc tttttataag caattgcttg ggaagtcaat taccttggat gacatggagt 1740 

tagtagatcc ggatcttcac aacagtttag tgtggatact tgagaatgat attacaggtg 1800 
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ttttggacca 
ttaaaccaaa 
tctatgtgaa 
gatttaatga 
tcattatttg 
aacactgtac 
atgaagagcg 
agggcttcaa 
atgcctgcac 
cctatgaaag 
gatttgctgt 
actgacagcc 
cccccccccc 
tttgaaattc 
gctgctattt 
agttaggcag 
aatttcttca 
tcacatttca 
aaaaaaataa 
ctttaagaaa 
cattatatgt 
ctggaataca 
t tag teat tt 
ttaaaaaaaa 
tacttttaat 
attttegtta 
agtctctgac 
acaaaaccat 
gaagaatttt 
atgaagtggg 
ctttgaagag 
atctgatttt 
agcaacgaaa 
attcaaccat 
tttcatgata 



taccttctgt 
tggcaaaagt 
ctggagattt 
agtaattcca 
tggacttgga 
accagacagc 
acgagcaaga 
ageattgeaa 
taacaacctg 
ctatgaaaag 
ggaatgacaa 
tcctttcagc 
cttttttttt 
cttttcaagg 
aaacttaata 
tattctacac 
caga£ttcaa 
ttttgattga 
aattattttg 
ttaagecatt 
ttcaacctag 
gcgatataat 
atttgtgaca 
aaaaacctaa 
cttaaaatac 
ttttgctggg 
aagtgcacct 
ectaaagegt 
gctctaaaga 
gttgaggggc 
ttcattttgg 
gaaaegtcca 
actctggggt 
caaaaatatg 
atta 



gttgaacata 
atccctgtta 
ttacgaggca 
caacatctgc 
aagatagatg 
aacattgtca 
ttgcttcagt 
ggtgctgcag 
ccgaaagccc 
ctatatgaaa 
gcttcaagga 
agagtttcaa 
tttttttaca 
aaaaaaaagg 
tcttgaacct 
ttaaagacta 
gtctctcaaa 
atacatgatc 
aattattcta 
tacatgtttg 
tctaagtggg 
tttggttgtc 
ataagatgtg 
tgtttttatc 
tgcattttta 
tcagcttatc 
aatttatatt 
taactgetea 
ttattaataa 
ggtagtatga 
aagtaaggga 
gatttagctt 
gagggattga 
tactatcagt 



atgcatatgg 
atgaagaaaa 
ttgaggctca 
tgaagacatt 
ttaatgactg 
aatggttctg 
ttgtgacagg 
gcccgagact 
acacttgett 
agetgetaac 
tttacccagg 
agaatatget 
ttttaggaca 
tetttatget 
aaagaatget 
ctactatttt 
catcaagaca 
ttgaacagct 
cctttgtaaa 
tgtttttcta 
tcttttttac 
aaattcctaa 
ttcaggggct 
ttgagtcaat 
tagcttc.tca 
tttaatatat 
gtattttaac 
taattttaat 
gcttagaaaa 
acagtgattc 
atcccgcttc 
tcaaggtgcg 
aaaaatctga 
ttttttctac 



tgaaattatt 
taaaaaagaa 
attcttggct 
tgatgagaag 
gaaggtaaac 
gaaagctgtg 
atcctctcga 
ctttaccata 
caatcgaata 
agecattgaa 
actctattta 
gaaatacagg 
ctgtgagggg 
ttgecatgag 
gacttttcct 
tataaaaggt 
acttcagcag 
cctgtacttg 
caattggcta 
tagcagagca 
atttttcaag 
tgcaaccatt 
ccctgttttt 
atgattaggt 
gagcatgtgg 
ggactattcc 
tacagtgtaa 
cagctacagt 
tectgetttt 
taactctatg 
ccttggtgtt 
tgcatgttga 
aatcacagtt 
attgatgtga 



cagcatgaac 
tatgtcaggc 
ctgcagaaag 
gagttagagc 
acccggttaa 
gagttttttg 
gtgcctctgc 
caccagattg 
gacattccac 
gaaacatgtg 
tacaacccca 
aaaacactcc 
aaaggacaat 
gccacattca 
acatttccag 
aatctattca 
teggtacaag 
ctctttgtaa 
aaagaatcat 
ttatattttg 
aaeggattte 
tagtctaaac 
aagagactct 
attttggatt 
atgggatggg 
tataaaccaa 
gtttccatta 
tatgaaaaag 
acctaacaga 
gegtaaatta 
gaagatctgc 
atgatccaga 
ggatttttaa 
aataagcaat 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3854 



<210> 197 
<211> 905 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 03 6034 . 1 : 2000FEB01 



<400> 197 

gggaggatga 

gagccccagc 

cgaagaaatc 

ctctctgcca 

tgaaccattc 

ctacacaaag 

ecatttgeta 

caatatgetg 

cagcttgaag 

gctgaagctc 

tccagtgaca 

gatggaatgg 

gattcttcag 

egggaaagtc 

ttggagaaag 

gaagc 



cctcttaccg 
gacaccc'gag 
cagaggatgt 
ttttcttagt 
atctgaaaac 
aactatggca 
aaggaatagt 
aagacctget 
actttttcaa 
actcagattc 
gtgatctggt 
cttccaatca 
cactaggggc 
gtagectett 
caggagaatt 



ggagcggagt 
gggctacaca 
ccaggaggga 
ccttttttcg 
tcttataatt 
aattcacaaa 
gtttgggcga 
acagttctct 

gggtggaata 

cctcattgat 
atctcttact 
aaattctccc 
tgttgcttct 
tcctggcagt 
aataaagctg 



gccgacctgg 
gtatataagg 
gaatcctgaa 
ggaaaactct 
gtatggaaga 
aacttattcc 
tttgatgaaa 
gecaatatte 
attaatgata 
acctttcctg 
gttgatgtgg 
attagaactt 
gacagtgaac 
ttaaagcega 
gctttaaaaa 



cccgtttcta 
tcaccgcccg 
ccaacctatc 
tttataataa 
gatacagtga 
gacattcaga 
ctgttatcga 
ctgetcttta 
gttctgaatt 
agtgtagtac 
attctcttgc 
ttggtctcaa 
agagcaaaac 
agcttggcaa 
aggaagaaga 



cactgtcacc 60 
ggttgtttca 120 
catgaacata 180 
caactgttgg 240 
ttttaagaaa 300 
gttgtttcct 360 
agagagaaga 420 
caatagtaaa 480 
aattggtcct 540 
ggaaggcttc 600 
tgagttagat 660 
tctttcttcg 720 
agaagaagaa 780 
gagagattat 840 
agacgactat 900 
905 



<210> 198 
<211> 585 
<212> DNA 
<213> Homo sapiens 
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<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 162161 . 1 : 200 0MAY19 
<400> 198 

gagtcttcgg cgagacctac cgggccctgt ctgtgccggc cgtgagctgc ggcaggacgg 60 
ctggagatta cttctctaac aggatcacca gtttgctcca caaagcacaa tgtctcgatc 120 
acgacaaccc ccccttgtga ccggcatctc tccaaatgaa gggataccat ggacgaaggt 180 
cacaatcagg ggagaaaatc tggggactgg ccccaccgac ctcataggct tgaccatttg 240 
tggacataat tgcctcctga cggcagaatg gatgtctgca agtaaaatag tatgtcgagt 300 
gggacaagcc aaaaatgaca aaggagacat tattgtcacc actaagtcag gtggcagagg 360 
aacctcaaca gtctctttca agctactcaa acctgagaaa ataggcattt tggatcagtc 420 
tgctgtgtgg gttgatgaaa tgaattatta tgatatgcgc actgacagga acaaaggaat 480 
tccgcccttg tccttacgtc ctgctaaccc gcttggcatg gagattgaac caagtacatt 540 
ttcgcagaag gacttagaaa tgctattcca tggaatgagt gctga 585 

<210> 199 
<211> 518 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 407214 . 10:20 00MAY19 
<400> 199 

gaaaagagtg ctccaccttc actgaagcca gcaatttggc cggctgttag cacttcattc 60 
atattcccta gtatgagtta attctttgtc attttatttc ttgtgattcc agaatctcat 120 
tttcagcttc tgccagagcc agtagcagtt gggaaagaag gctgtgctct ttgaagagag 180 
tcagccagcc tgaaagagca ggatggatct tgatgtggtt aacatgtttg tgattgcggg 240 
cggcacgctg gccatcccaa tcctggcatt tgtggcttca tttcttctgt ggccttcagc 300 
actgataaga atctattatt ggtactggcg gaggacattg ggcatgcaag tccgctatgt 360 
tcaccatgaa gactatcagt tctgttattc cttccggggc aggcctgggc acaaaccctc 420 
catcctcatg ctccacggat tctctggcca caaggatatg tggctcagtg tggtcaaagt 480 
tccttccaaa gaacctgcac ttggtctgcg tggacatg 518 

<210> 200 

<211> 1233 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 204626 . 1 :2000MAY19 
<400> 200 

gcagaaccag attctccagg tgtgaagtaa gcaatctatt ttcacttgct attaacagca 60 
aacgtacaac tttatctctt atttgacttc attatttggc tttcaaatta acaacatgaa 120 
aactatacag aaattagtct gtgccctata agacagagat ttagataatt tgttcaaaac 180 
atgagatgat attcagatat tctctcaaag gcaggactcc aacaccgcag gtggaaaacc 240 
aacaaaagga agaaaacaga aaggtggggg agataaggag tagccaggaa aaagaaaata 300 
gtgataaagg aagagccaaa aaggaggaaa gggaagatga aagaccgact tcaagaacta 360 
aagcagagaa caaaggaaat tgaactctct agagacagtc atgtatcaac tacagaaaca 420 
gaggaacaag gggtgtttct acagcaagct gttatttatg aaagagagcc tgtagctgag 480 
agacacctac atgaaatcca aaaactacag gaaagtatta acaatttggc agataatgtt 540 
caaaaatttg ggcagcaaca gaaaagtctg gtggcttcaa tgagaaggtt tagtctactt 600 
aagagagagt ctaccattac aaaggagata aaaattcagg cagaatacat caacagaagt 660 
ttgaatgatt tagttaaaga agttaaaaag tcagaggttg aaaatggtcc atcttcagtg 720 
gtcacaagga tacttaaatc tcagcatgct gcaatgttcc gccattttca gcaaatcatg 780 
tttatataca atgacacaat agcagcaaag caagagaagt gcaagacatt tattttacgt 840 
cagcttgaag ttgctggaaa agagatgtct gaagaagatg taaatgatat gcttcatcaa 900 
ggaaaatggg aagtttttaa tgaaagctta cttacagaaa tcaatatcac taaagcacaa 960 
ctttcagaga ttgaacagag acacaaggaa cttgttaatt tggagaacca aataaaggat 1020 
ttaagggatc ttttcattca gatatctctt ttagtagagg aacaaggaga gagcatcaac 1080 
aatattgaaa tgacagtgaa tagtacaaaa gagtatgtta acaatactaa agagaaattt 1140 
ggactagctg taaaatacaa aaaaagaaat ccttgcagag tactgtgttg ttggtgctgt 1200 
ccatgctgta gctcaaaata aagaagctat ttt ^ 1233 
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<210> 201 
<211> 1002 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 007401 . 1 : 2000MAY01 
<400> 201 

gcggccgctt tgccggtcat tcccgaagcc cggcaactga gggcggcccc ctttccttaa 60 
cagtctcctc gctacagatc gtctgctccc tcagcctcgc ccgagaccca cttccccagt 120 
ctcgcccggg tggaggtcga cgaggaggag acaagagtca cccttcctcc aggcggcgcc 180 
ggcccccctc acccgcgggt gtgtcctata aatggcgtcg gaaagcgaca ccgaggaatt 240 
ctatgatgcc cctgaagatg tgcacctagg gggcggctac cccgtggggt ctccaggaaa 300 
agttgggctt tcaacattca aggaaacaga gaacactgca tacaaagttg gaaatgagtc 360 
ccctgtacaa gaattgaaac aagatgtgtc taacaagatt atcgaaagta ttatcgagga 420 
gagtcagaaa gtactacagc ttgaagatga ctctttggat tccacaggaa aagaactctc 480 
tgatcaagct actgccagtc ctattgtggc tagaacagat ctgagcaata tacccggact 540 
gttagccata gatcaagtac taccggaaga atcccaaaag gcagagagtc agaatacatt 600 
tgaagagact gaattagaat ttaaaaaaat gctttccttc tgatgaaacc tgtgagaaac 660 
cagtagatga aaccacgaag ttaactcaaa caagttcaac tgagcagctt aatgtgcttg 720 
aaactgaaac agaagtattg aacaaggaag cagtggaagt caaaggaggt ggtgatgttt 780 
tagagcctgt gtcctcagac tccttatcta ctaaagattt tgccgctgtg gaagaagtgg 840 
cccctgccaa acccccaaga caccttactc cagagcctga tatagtggct agtacaaaga 900 
agcctgttcc agcacgccca cctcctccaa ctaatttccc acctcctaga cccccacctc 960 
cttctcgacc tgctccacca ccaagaaaaa ggaaaagcga at 1002 

<210> 202 
<211> 444 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 47 63 42 . 1 :2000MAY01 
<400> 202 

ggacggacgc gcgtctccgg tcccgtccgt aataacactg atccgatcca cgccggccgg 60 
cgatggagct catcaagtcc agggcgaccg tgtgcgcgct cctcctggcg ctgctcctgc 120 
tctcacacta cgacggcggg acgacgacga cgatggtggc ggaggcccgg gtgtgcatgg 180 
gcaagagcca gcaccactcg ttcccctgca tctccgaccg cctctgcagc aacgagtgcg 240 
tcaaggagga cggcgggtgg accgccggct actgccacct ccgctactgc aggtgccaga 300 
aggcgtgcta agcaaagctc ttcaaacacc cttggcttgc cagaactgaa ctctagtagt 3 60 
actaagtaac acccttggct agctgtgcac aacctacgta ccgtgcatgc atgtaatgtg 420 
gtgtcatgta acgtgacagc aata 444 

<210> 203 
<211> 1956 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID 3 No: LI : 1072759 . 1 :2000MAY01 
<400> 203 

tctagatcgc gagcgggccg cttttttttt cgcaacgggt ttgccgccag caacacaggt 60 
gtctgtgaac actaccccct aaaagtccaa taactgggaa caggaaactg actcatatta 120 
acacttggtc cgtcacttgt gacacggttg aatccgggca aggtcccacc aattatgggc 180 
catctgattc taataaatgc ggtggcatcg aacaaaagaa ccattgaaaa atttgacgac 240 
aggaggctgc tgagcatggg aaagggctcc ttcaagtatg ccttgggtct ctggataaac 300 
tgaaaggcta gagcgtgaac gtggtatcac cattgatatc tccttgtggg aaccttctga 360 
gaaccagcaa gtactatgtg actatcattg actgccccag cgacacagca gacctttact 420 
caacacaaca tgcattacag ggtacatctc aggctgactt gtgctgtcct gattgttgct 480 
gctggtgttg gtgaatttga cagctggtat ctccaagcaa tggggcggaa cccggagagc 540 
atgcccttct ggccgtaccc actggggtgt gaaaccaact aatagctcgg tgttaacaaa 600 
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agtggattcc actgagccaa cctaaaggcc cgaagaagat atgaggcaac ttgttcacgg 660 
aagtcagcga ggtacattaa gaaaaatggg cttacaaccc cgacacagta gcctttggtg 720 
.ccaatttctg gttggaatgg tgacaacatt ctggaagcaa ggtgctatca ttgccttggt 780 
ttcaaggaga tggaaagtca cccgtaaggg atggcaatgc caggtgagaa ccacgcttgc 840 
ttgcaggcta ctagactgca tcctaccacc aaactcgtac aactgacaag gcccttgtgc 900 
ccgtgcactc tccaggatgt ctacaaaatt ggtggtagtt ggtactgtgt cctgttgtgc 960 
gcgatgtgga gactggtgtt ctcataaccc gcgtatcggt aggtcacctg ttgctccgag 1020 
tcagacgtta acatctcgaa gtaaacatct gtcgaaatgg gccctgaaag gcttggagtg 1080 
aagctcttcc tgggggacaa tgtgggcttc aatgtcaacg aatgtgtctg tcaagggact 1140 
gttcgtcgtg gcaacgttgc tggtgacagc aaaaattgac ccaccaatgg gagcagacgt 1200 
gtggcttcac atgctcaggt gattattcct tgaaaccatc caggccaaat aatgcgccgg 1260 
ctatgctctc tgtattggat tgcctactct ggcfccactat tgcatgcagg ttttgctgag 1320 
tctgaaggaa aagattgatc gccgttctgg tataaaagtc tggaagtatg gccctaggat 1380 
aacgtgaaga tctggtgatg ctgcgcatgt gttgatatgg gttcctggca aggcccatgt 1440 
gctgttgaga gcttctctag actatccacc tttggggtcg ctttgctgtt cgtgatatgt 1500 
agactagaca gttgcggtgg gtgtctatct aatagtctag tggactaaga taggctgctg 1560 
gtagctggtc ataggtctac ccaagtctgc ccagaaagct cagaagggtt aattggaata 1620 
ttatccccta atacctgcct acccctactc ttaatcagtg ggtgaaagaa cggttctcag 1680 
aacctgtttg ttttcaattg gccaatttaa agttaagtag taaaagactg gtttaatgaa 1740 
taacaatgca tcgtaaaccc cttcagaagg aacaggagaa tgttttgtgg accactttgg 1800 
ttttctattt ttttgcgtgt ggcagtttca agttattagt ttttaaaatc agtacttttt 1860 
aatggaaaca acttgaccaa aaattgtgtc acagaaattt tgagacccat taaaaaaagt 1920 
taaatgcaga aaaaaaaaca aaaaggtggc gggccg 1956 

<210> 204 

<211> 1726 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 998857 . 1 : 2000FEB18 
<400> 204 

ctccatgcca ctctgacacc ctggcagtgc actgtgggta cacctcgacc ccgcctcacg 60 
gatgaggggt gctgagcggt cattggtgat ggtgggacga agcttgacag tggcaaaagg 120 
atttgtgccc ctcgggaaga gctcctgtgg tgggtactgc tctgaggaca tcagtttctt 180 
gacgtcagtg gcaactgctg tgctgcccat gctgctgcgg cggctgctgg gcaagggtgg 240 
aggtgcaggg ctgggggcac ggcttggcac ccggcttggg gtggagttgc ccggccggtc 300 
caggaggtca tcgaggctgt ggctgccccg gagtgggtag gacctggaag gcagctcgtt 360 
cccggggttc atgggtgtca tgggtgtcat gggggacatg gaggtcatgg aggtcatggg 420 
ggtcatgggg gtcatgggat tcacgggccc ctcctccaga gccttcacgt acgcctcggg 480 
gaaccaaccg ctcgcggacg agccctccag cttgccgtag agccagccgt tctgggcctc 540 
gggcaccaac acctccacca cgtccccagc ggagaagcgc agcagcgtgt ggttggcgcc 600 
ctcggagtgg gagaccaggg cgcggactct cctggcgccc ccgccgccgc ccgggcgctc 660 
gccaaaggag ttggagcgcg agctttgggc gctgccgctg tagagcgagg aggccgacgg 720 
cgtgcggggc agggagcgac ggtctggctc tagctgggac gcgggcctcg cgtcgggctc 780 
ggtgccgtag gagccggagc cgtgccggct gcggggggag ctgaactctc ccaggggcct 840 
cgggggcatg tccaggcagg. tgggcgtcag gcggcccgag gggtagggcg gccccagcgc 900 
ggggcccagc aggccggggg agtgggcgcg cgacgggctg cggctggcct cagactgctc 960 
cttccacagc agcacgcggt tctggagcat cccccgggcc cggccgaaga actgcaggaa 1020 
ggtgttggaa agtagcaggt gcttctctgc taggaagcga tagcgccgct tctcttccaa 1080 
ttcagccgcc cgctgactct cagacacgaa ggcctgcatc tgtgcgtgca gccggttcac 1140 
actctccttc atctcccgca cgttcttgtc tctcttgcgc tccatgcgcc acagctcaga 1200 
catgcacttc tccaggttgg ccgctcggtg gcggtactcg agctcatagt gctggcggct 1260 
gtctttgatg aactgcatgt ccagcttggt gttcttctcc atgtgctgca gcaggcctcc 1320 
atggaatgtc tgcaccacca cctccaggtc agagttcaag tgccgctggg tgtcagacat 1380 
ctgcaccaag atctccccca gaatctgtga ggtggggctc tgcagggcac gctccccaat 1440 
cttctggatg gcactgaagt agacctcggc cgcctcggac agagcgtgga aggcacgcag 1500 
gtagttgttg cccaggtaca ccaggttctc cagggcgggg ttaaactgct ccatgatgct 1560 
cttgtagatg gccatggtgg acctgtagaa ctggtccatc tcgggggcca tggaggggct 1620 
gtcccgggtc tgagcaggag gctgggagct ggtggcgatg gcacagccgg gagcagtggt 1680 
aggtagtccc tcaggtgccc acgactcagc tggcagcgag gaagcc 1726 

<210> 205 
<211> 1595 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> misc_featiire 

<223> Incyte ID No: LG: 482261. 1 :2000FEB18 
<400> 205 

ctccacgttg cgatggatcc ttggacccac ttttgttaac tcttaaactt tgtgtctttg 60 
tctttatttc ttttctcatt ccctcgtctc caccgggaag o;ggagagcct gcgggtggtg 120 
tatcaggcag gttcccctac atctttggca cccaacacgg tctcctcgaa cccaggtgaa 180 
gttacacctg agcgtggtcg ttgtgaagaa cggtctgtcc aggaactccc gagaacgfcgt 240 
ggtcggcctt gcggtaagct tgtgcactcg gagcattcca gggacaccat gggacaatcc 300 
aaaagtaaac attctgcata tttacatttt attaagctcc tcttaaagag ggcaggaatt 360 
aaggctagca cagaaaattt gattactctg tttccaacag tagagcaata ttgtccttgg 420 
tttcctgaac atggtaccat ggacttcaaa gattgggaac aggtgggaat tgccttaaaa 480 
caagtttgta aggaaggaaa atttatcccc ctaacagcct ggtcaaactg ggctatagtt 540 
aaagcagcct cggaaccgtt tcaatcggaa aatgaggctt atcctccagc agaaagaatt 600 
tctgcagagg aaggtggtga tgctgctgaa ggaggagagg atagtgaaga agattttgag 660 
gaaaatacag acaaacctgg agatgagtta atttcttttg aggagcacgt gggaccttca 720 
gctgctccta aaatagagaa gccatatatg ccaagatgtt taaaacaaag aagggccttg 780 
aggagcagtc ggctcctcat tgggatcatc cggagtggcc gcctccaata aagcaatgta 840 
gcttggagcc ttggaggtct gaatctcaaa tttgccctgt ttcacgaatg aatgaattgt 900 
ggcctcagga accacaagcg catggtgtag caccagtaca acataaggct gcactgccat 960 
ctaatgttaa tgaatcgcca ttacag.tt.ta ttattcggca ggctagatta gccggagatc 1020 
ttgatgcctg gcagtttgca gtagttttgc aacccccacg acagcaaggt ggagcccatc 1080 
aagcggtatg ggaaccattt tcttttaagc tgctcaaaga tcttaaagca gctgttggtc 1140 
agtatggtcc caattcgcct ttcgtccgat cgctattgca atctgtggct cagaataagc 1200 
tattgactcc gtgtgattgg gagattttaa cgaaagttac actttcgccc tcccaatttc 1260 
ttcagtttaa gacttggtgg accgacgagg ctcaaaatca gatcgaaaaa accgtgctgc 1320 
taatcctgct attgccatta catttgaaca acttctagga atagggggtc aatggggaac 1380 
tgtaaacaac catcaggact tcgagatgat gccattgaac aaattcgcaa ttgctgtttg 1440 
agggcatggg agaaaattca ggatctggga actacttatc agtcttttaa ttctattaga 1500 
caaggctcaa aggaaccata tcctgatttc attgctcgcc ttcaagacgc agcacagaag 1560 
gctctcactg atgaaagtgc caggaaggcc gggtg 1595 

<210> 206 

<211> 2006 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 480328 . 1:2000FEB18 
<400> 206 

gccacaagcc tcatatatca tgaccgaact catgctggtg aaaagtccta tgaatgtaaa 60 

gaatgtgggg aaacttttag tcatgcttca catcttatta ttcatgagag aattcatacc 120 

agtgataaac cctatgaatg taaaagatgt gggaaggcat ttcactgtgc ctcatatctt 180 

gttagacatg aaagcgttca tgctgatgga aatccctata tgtgtgaaga gtgtgggaaa 240 

gcttttaata gtagccatga acttagtata catcatagag ttcatactgg tgagaaaccc 300 

tttaaatgta acaaatgcag aaggtccttt aggcttagat ccatccttga agtacatcag 360 

agaattcata tttgatagag aaattgtggc atatacaggt agtaaagaaa gcacattgtg 420 

tcaatggatt tatcccactg aacaatttga atgaagacaa tgtggaatgg ctttgagata 480 

cagttcagtt tttattaggc atcaatttgt attggtgtga acttttcaaa atgtaaatca 540 

tatatgcagt ttctttggta tgacctgtgt taactccaga ttcagagaat ttataatgta 600 

tttttttaat aacataagaa aacttttctg atattcctct agttttggat tgccaaaata 660 

ttcatagtag aggaaaactg agaaaacgta ggaaaccatt ctttatactt gcatgtaaac 720 

atcttccagc aacaatcatt tattgttaaa cctggaaaaa gccattacac aggaaacctc 780 

tgttgatata agcgatgatg ggaaaggcta tttagccact ctgcatattt aatgctcatg 840 

atcaacatct gcttcctata gctgggatga tatgggggaa gatatttatc ccttctgggg 900 

atcatatata aatgatgata gtagtaccta cagtatagtg ctgttagaat tacatgagtt 960 

agatgtggag gtcagagtgg aagcaggtgt gagagggtcc cgcagaagaa aacatggctg 1020 

ccaaagtgtt tgagtccatc ggcaagtttg ggcctggcct tagctgttgc aggaggcatg 1080 

gtgaactctg ccttatgtaa tgtggatgct gggcacagag ctgccatctt tgaccaattc 1140 

cgtggagtac agaacattgt ggtaggggaa gggactcact ttctcatccc atgtgtacaa 1200 

aaaccaatta tctttgactg ctgttctcaa ccacgtagtg cgccagtcat cactggtagc 1260 

aaagatttac agaatgtcaa catcacactg tgcatcctct tccggcccat cactagccag 1320 
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cttcctcgca 
accacggaga 
agggagctgg 
ctcattctgg 
tggcccccca 
ctgagggtga 
gccagagcga 
ctcggaacat 
gccccagcct 
ccacctttcg 
agtaaaggta 
aaaaaattat 



tcttcaccag 
tcctcaagtc 
tctccagcca 
acgacgtgtc 
ggaagcagag 
ctccaaggca 
gctgtgcaag 
cacct'gcctg 
acctgcacct 
ccctcacccc 
aaatcacttc 
gggaaactac 



cattggagag 
actggtggct 
ggtgagcaac 
tttgacacat 
agcgccagat 
get gage tga 
ctggaagctg 
ccggcagggc 
ccgtgaggca 
agaaatcaet 
agatctctaa 
atgaaa 



gactacgatg 
cgctttgatg 
aaccttatgg 
ctgaccttct 
ttgtggtgga 
tcgccaactc 
cagaagacat 
agtccgtgct 
actgggccac 
gaaatttcat 
aaaacaaaga 



agtgtgtgct 
ctggagaact 
agtgagcagc 
tgaaggagtt 
aaaggeggee 
actggccact 
tgcataccag 
cctccagctg 
agccccgatg 
gattggctta 
attacatgag 



geegttcatt 
aatcacccag 
cacctttggg 
aacagacagg 
atcatctctg 
gcaggggacg 
ctctcatgct 
ccctagtgag 
attegtaaca 
aagtgaagga 
ttagtacatg 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2006 



<210> 207 
<211> 984 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



LG:311197.1:2000MAY19 



<400> 207 

gctctccacg 

gecaaatttg 

atgaaaaaca 

ttcccacatt 

gctgtcatcc 

caatgaagag 

agtgatgctc 

ttggtgggaa 

ggtccaggca 

gecgaatttg 

cagtgatatc 

gtctggtgac 

tgaagatatt 

tttatacagt 

caccatgact 

cagccaaccc 

tgatacagee 



tgcgcgactg 
atatctggag 
aaaeaaaaaa 
ccaatctatg 
actgacttta 
cagcagaaag 
aagttagtag 
cagaagcatt 
gatgeaaage 
aagatggtga 
tgeaaaatec 
tattttaaga 
ctaaacctgg 
aaaaqcatga 
tggttcagtg 
ccccagtccc 
aagctcaatg 



cgaggctgga 
cacctggaac 
gaaattttgc 
tetgectttg 
catttgette 
acgtcacact 
aac agate aa 
gctggcttct 
ttctcttcac 
ggttgcgagt 
tgaatattag 
agaagaagaa 
agagttctcc 
cccctatata 
acagcccttt 
cagaagcact 
cagg 



cgctacgggc 
ettgeagagt 
tetgeaagtc 
tgttcagcag 
ctgggagctt 
gagagtatct 
tatatcccaa 
gaaaacccac 
ccctcagcat 
cagcttctca 
aagatcagaa 
aaaagaeaaa 
aa cage t tea 
tgaccccatc 
gaeggaacaa 
tgcggatatg 



tcctggaaag 
gttttgaaga 
tactgacaag 
acaccagcat 
gtggtccgcg 
ggagaccttc 
gactggtcag 
tggaccctgg 
aaaatgctgc 
gctgtggttt 
gagctttcct 
aataataagg 
ggttcatcag 
aatggaacac 
aactgeagea 
taccagcctc 



gagttaagat 60 
ggaggttaca 120 
tcccgtttcc 180 
ttgccacaat 240 
ttgaccatcc 300 
acgttggagg 360 
actttgetet 420 
acaaatatgg 480 
gccttcgtct 540 
ttaaagctgt 600 
tgttaaagcc 660 
aacccataat 720 
taagtcctgg 780 
cagcatcatc 840 
tcctcgcatt 900 
ggtctctggt 960 
984 



<210> 208 
<211> 565 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1054883 . 1 :2000MAY19 



<220> 

<221> unsure 
<222> 19 
<223> a, t, c, 



g, or other 



<400> 208 

geagagtgee 

tccacgcatg 

acttatacag 

tatcaggtca 

gaatgtatgg 

ageccgagaa 

gtttttgaaa 

gaagaaaaag 

agtaaaaagc 

aataaatatg 



cctgaagang 
gecgaagcat 
taagggcttc 
ccgaagtagg 
aaacattaaa 
atgaccgacc 
taagaactgg 
aaatccccaa 
tagaaatagg 
tttgtcatga 



ccgacagccc 
gaggaagctc 
aactaacctt 
acagcaaatt 
agaaacttgt 
tgatgcgtgt 
ecttttgeta 
geaaataact 
atgtggttct 
gtcag 



ctgctccagt 
ategttagat 
actcaaaatg 
aaaacaatct 
ttgtataatg 
tataacccat 
ggtgatacaa 
ttaagatttg 
cttaactgag 



cacacccgga 
tcatttttct 
gggactgttc 
ttctgttcta 
ccactcagta 
ctgagcccgc 
gtaaaataat 
atgcttgtgc 
aaaggagcta 



agctgactgg 60 
taaattttgg 120 
ccagtgtatt 180 
tagttattat 240 
caaggtatgt 300 
tgcaaccacc 360 
aaetagaaca 420 
agecattaat 480 
aagagtagaa 540 
565 
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<210> 209 
<211> 567 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc„feature 

<223> Incyte ID No: LG:399395 . 1 :2000MAY19 
<400> 209 

acagttcaca cttactactg ttgtagttat aaattattca cttgttttaa ataagcacat 60 
atatgatggc tgagtgagaa tgatctagaa ctatgccagg gattagataa aaacacagag 120 
taaaagttgg ccatatagta acatttttga tgtccaaatt ggagaggaat ccctcaaatg 180 
ttcaaaaata taggaatggg tcgggtgtgg tggctcatgc ctgtaatccc agcactttgg 240 
gaggccaagg caggaagatc acttgagccc aggagttcaa gaccagcctg ggcaacacag 300 
tgagaccctg actcaaaaaa taaaaagaaa agacaaaaat acaagaatgg ctaagcaaac 360 
tagtgtacat caacctggag gaatattata tagtctttta aaataacaaa tatgtagatt 420 
gttaatacat gggaaaaggt atatgaaata ggtggaacca agcagaactg agtcccaagg 480 
tgggtgggaa agaggctgcc atgctggctg aggtacctga gaacaatgaa aacaagtcat 540 
attaagacca atatcaatat cccactg 567 

<210> 210 
<211> 971 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:380497 . 2 :2000MAY19 
<400> 210 

tcgccccgtc cgctgcagtc cgccggcgag ggagttacgc acgtcctgat tctcctggag 60 

tctccagccc gcccagtggc cgcagtcacc caggtccaga ggcggcggta tcacaggctc 120 

tccgacatgt ctatgctggc tgaacgtcgg cggaagcaga agtgggctgt ggatcctcag 180 

aacactgcct ggagtaatga cgattccaag tttggccagc ggatgctaga gaagatgggg 240 

tggtctaaag gaaagggttt aggggctcag gagcaaggag ccacagatca tattaaagtt 300 

caagtgaaaa ataaccacct gggactcgga gctaccatca ataatgaaga caactggatt 360 

gcccatcagg atgattttaa ccagcttctg gccgaactga acacttgcca tgggcaggaa 420 

accacagatt cctcggacaa gaaggaaaag aaatctttta gccttgagga aaagtccaaa 480 

atctccaaaa accgtgttca ctatatgaaa ttcacaaaag ggcgatgcca gtccctccac 540 

tccagaggag aacgaaacca cgacaaccag cgccttcacc atccaggagt actttgccaa 600 

gcggatggca gcactgaaga acaagcccca ggttccagtt ccagggtctg acatttctga 660 

gacgcaggtg gaacgtaaaa gggggaagaa aagaaataaa gaggccacag gtaaagatgt 720 

ggaaagttac ctccagccta aggccaagag gcacacggag ggaaagcccg agagggccga 780 

ggcccaggag cgagtggcca agaagaagag cgcgccagca gaagagcagc tcagaggccc 840 

ctgctgggac cagagttcca aggcctctgc tcaggatgca ggggaccatg tgcagccgcc 900 

tgagggccgg gacttcaccc tgaaagccaa aaagaggaga ggcgagaaaa agctgcaaaa 960 

accagtagag a 971 

<210> 211 
<211> 486 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 272913 . 22 :2000MAY01 
<400> 211 

agcgcgacag cggcagcacc aagggaacgg aaaatggcgc ctcacggccc gggtagtctt 60 

acgaccctgg tgccctgggc tgccgccctg ctcctcgctc tgggcgtgga aagggctctg 120 

gcgctacccg agatatgcac ccaatgtcca gggagcgtgc aaaatttgtc aaaagtggcc 180 

ttttattgta aaacgacacg agagctaatg ctgcatgccc gttgctgcct gaatcagaag 240 

gaccctggtc caaactttca tcaggcacat accactgtca tcatagacct gcaagcaaac 300 

cccctcaaag gtgacttggc caacaccttc cgtggcttta ctcagctcca gactctgata 360 

ctgccacaac atgtcaactg ttctggagga attaatgcct ggaatactat cacctcttat 420 

atagacaacc aaatctgtca agggcaaaag aacctttgca ataacactgg ggacccagaa 480 
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atgtgt 486 

<210> 212 
<211> 115 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 1040582 . 1 .orf3 :2000FEB18 
<400> 212 



lie 


Leu 


Ala 


Thr 


Leu 


Ser 


Ser Pro Gin Pro 


Val 


Leu 


He Ala Lys 


1 








5 




10 






15 


Gly 


Thr 


Met 


Thr 


Ala 


Ser 


Ser Val Leu Leu 


His 


Thr 


Gly Gin Lys 










20 




25 






30 


Met 


Pro 


Leu 


He Gly 


Leu 


Gly Thr Trp Lys 


Ser 


Glu 


Pro Gly Gin 










35 




40 






45 


Val 


Lys 


Ala 


Ala 


He 


Lys 


Tyr Ala Leu Ser 


Val 


Gly 


Tyr Arg His 










50 




55 






60 


lie 


Asp 


Cys 


Ala 


Ser 


Val 


Tyr Gly Asn Glu 


Thr 


Glu 


He Gly Glu 










65 




70 






75 


Ala 


Leu 


Lys 


Glu 


Ser 


Val 


Gly Ser Gly Lys 


Ala 


Val 


Pro Arg Glu 










80 




85 






90 


Giu 


Leu 


Phe 


Val 


Thr 


Ser 


Lys Leu Trp Asn 


Thr 


Lys 


His His Pro 










95 




100 






105 


GlU 


Asp 


Val 


Glu 


Pro 


Ala 


Leu Arg Lys Thr 
















110 




115 









<210> 213 
<211> 161 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 453570 .JL .orf 3 :2000FEB18 
<400> 213 



Cys 


Thr 


Leu 


Leu Arg 


Leu He Pro Ser 


Ala 


Ser 


His 


Phe Lys Arg 


1 






5 




10 






15 


Phe 


Asp 


Arg 


Val Arg 


Arg Phe Ala Pro 


Ala 


Ala 


Met 


Ala Thr Ser 








20 




25 






30 


Ser 


Gly 


Pro 


Lys Glu 


Ala Pro Ala Asn 


Asn 


Pro 


Gly 


Leu Gin Thr 






35 




40 




45 


Glu 


Val 


Asp 


Pro Ala 


Thr Lys Gly Tyr 


Phe 


Leu 


Gin 


Gin Thr Met 








50 




55 






60 


Leu 


Arg 


Val 


Lys Asp 


Pro Lys Val Ser 


Leu 


Asp 


Phe 


Tyr Ser Arg 








65 




70 






75 


Val 


Met 


Gly 


Met Ser 


Leu Leu Lys Arg 


Leu 


Asp 


Phe 


Glu Glu Met 








80 




85 






90 


Lys 


Phe 


Ser 


Leu Tyr 


Phe Leu Gly Tyr 


GlU 


Asp 


Val 


Thr Leu Ala 








95 




100 






105 


Pro 


Asp 


Asp 


His He 


Lys Arg Thr Glu 


Trp 


Thr 


Phe 


Arg Gin Lys 








110 




115 






120 


Ala 


Thr 


Leu 


Glu Leu 


Thr His Asn Trp 


Gly 


Thr 


Glu 


Asn Asp Pro 








125 




130 






135 


Glu 


Phe 


Lys 


Gly Tyr 


His Asn Gly Asn 


Ser 


Asp 


Pro 


Arg Gly Phe 








140 




145 






150 


Gly 


His 


He 


Gly Val 


Thr Val Asp Asp 


Val 


His 












155 


160 









<210> 214 
<211> 332 
<212> PRT 
<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 408751 .3 .orf 2 :2000FEB18 
<400> 214 

Arg Asp Pro Gly Trp Gin He Arg Asp Arg Ala Gly Leu Ala Trp 
15 10 15 

Asn Met Leu Ala Asn Ser Ala Ser Val Arg He Leu He Lys Gly 
20 25 30 

Gly Lys Val Val Asn Asp Asp Cys Thr His Glu Ala Asp Val Tyr 
35 ~ 40 45 

He Glu Asn Gly He He Gin Gin Val Gly Arg Glu Leu Met He 
50 55 60 

Pro Gly Gly Ala Lys Val He Asp Ala Thr Gly Lys Leu Val He 
65 70 75 

Pro Gly Gly He Asp Thr Ser Thr His Phe His Gin Thr Phe Met 
80 85 90 

Asn Ala Thr Cys Val Asp Asp Phe Tyr His Gly Thr Lys Ala Ala 
95 100 105 

Leu Val Gly Gly Thr Thr Met He He Gly His Val Leu Pro Asp 

110 115 120 

Lys Glu Thr Ser Leu Val Asp Ala Tyr Glu Lys Cys Arg Gly Leu 

125 130 135 

Ala Asp Pro Lys Val Cys Cys Asp Tyr Ala Leu His Val Gly lie 

140 145 150 

Thr Trp Trp Ala Pro Lys Val Lys Ala Glu Met Glu Thr Leu Val 

155 160 165 

Arg Glu Lys Gly Val Asn Ser Phe Gin Met Phe Met Thr Tyr Lys 

170 175 180 

Asp Leu Tyr Met Leu Arg Asp Ser Glu Leu Tyr Gin Val Leu His 

185 190 195 

Ala Cys Lys Asp He Gly Ala He Ala Arg Val His Ala Glu Asn 

200 205 210 

Gly Glu Leu Val Ala Glu Gly Ala Lys Glu Ala Leu Asp Leu Gly 

215 220 225 

He Thr Gly Pro Glu Gly He Glu He Ser Arg Pro Glu Glu Leu 

230 _ 235 240 

Glu Ala Glu Ala Thr His Arg Val lie Thr He Ala Asn Arg Thr 

245 250 255 

His Cys Pro He Tyr Leu Val Asn Val Ser Ser He Ser Ala Gly 

260 265 270 

Asp Val He Ala Ala Ala Lys Met Gin Gly Lys Val Val Leu Ala 

275 280 285 

Glu Thr Thr Thr Ala His Ala Thr Leu Thr Gly Leu His Tyr Tyr 

290 295 300 

His Gin Asp Trp Ser His Ala Ala Ala Tyr Val Thr Val Pro Pro 

305 310 315 

Leu Arg Leu Asp Thr Asn Thr Ser Thr Tyr Leu Met Ser Leu Leu 

320 325 330 

Ala Lys 5 



<210> 215 
<211> 274 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 090574 . 1 .orf 3 :2000FEB01 
<400> 215 

Ala Arg Asn Ser Ala Arg Val Pro Pro Arg Gly Thr Met Ser Arg 
15 10 15 

Leu Ser Trp Gly Tyr Arg Glu His Asn Gly Pro He His Trp Lys 
20 25 30 

Glu Phe Phe Pro He Ala Asp Gly Asp Gin Gin Ser Pro He Glu 
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35 








40 






45 


lie 


Lys 


Thr 


Lys 


Glu 
50 


Val 


Lys 


Tyr Asp 


Ser 
55 


Ser 


Leu Arg Pro 


Leu 
60 


ser 


He 


Lys 


Tyr 


Asp 


Pro 


Ser 


Ser Ala 


Lys 


He 


He Ser Asn 


Ser 






65 








70 






75 


Gly 


His 


Ser 


Phe 


Asn 
80 


Val 


Asp 


Phe Asp 


Asp 
85 


Thr 


Glu Asn Lys 


Ser 
90 


Val 


Leu 


Arg 


Gly 


Gly 
95 


Pro 


Leu 


Thr Gly 


Ser 
100 


Tyr 


Arg Leu Arg 


Gin 
105 


Val 


His 


Leu 


His 


Trp 
110 


Gly 


Ser 


Ala Asp 


Asp 
115 


His 


Gly Ser Glu 


His 
120 


He 


Val 


Asp 


Gly 


Val 


Ser 


Tyr 


Ala Ala 


Glu 


Leu 


His Val Val 


His 










125 






130 






135 


Trp 


Asn 


Ser 


Asp 


Lys 


Tyr 


Pro 


Ser Phe 


Val 


Glu 


Ala Ala His 


GlU 








140 








145 






150 


Pro 


Asp 


Gly 


Leu 


Ala 


Val 


Leu 


Gly Val 


Phe 


Leu 


Gin He Gly 


Glu 






155 






160 




165 


Pro 


Asn 


Ser 


Gin 


Leu 


Glh 


Lys 


He Thr 


Asp 


Thr 


Leu Asp Ser 


He 










170 






175 






180 


Lys 


Glu 


Lys 


Gly 


Lys 
185 


Gin 


Thr 


Arg Phe 


Thr 
190 


Asn 


Phe Asp Leu 


Leu 
195 


Ser 


Leu 


Leu 


Pro 


Pro 
200 


Ser 


Trp 


Asp Tyr 


Trp 
205 


Thr 


Tyr Pro Gly 


Ser 
210 


Leu 


Thr 


Val 


Pro 


Pro 


Leu 


Leu 


Glu Ser 


Val 


Thr 


Trp He Val 


Leu 










215 








220 




225 


Lys 


Gin 


Pro 


He 


Asn 


He 


Ser 


Ser Gin 


Gin 


Leu 


Ala Lys Phe 


Arg 








230 








235 




240 


Ser 


Leu 


Leu 


Cys 


Thr 


Ala 


Glu 


Gly Glu 


Ala 


Ala 


Ala Phe Leu 


Val 








245 






250 






255 


His 


His 


Leu 


Ala 


Thr 


He 


Pro 


Cys Gly 


Ser 


Ser 


Ser Ala Thr 


Cys 










260 






265 






270 


Ser 


Gly 


His 


Cys 



















<210> 216 
<211> 182 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 22 9932 .2 . orf 1:2000FEB01 
<400> 216 



Lys 


Leu 


Pro 


Leu 


Pro 


Pro 


Gly 


Ala Phe 


Ser 


Gly 


Leu Trp Lys 


Asn 


1 








5 








10 






15 


Gin 


Glu 


Ala 


Phe 


Lys 


His 


Leu 


Tyr Phe 


Glu 


Lys 


Phe Pro Gly 


Tyr 










20 








25 






30 


Tyr 


Asp 


Thr 


Met 


Asp 


Ala 


Gly 


Tyr Met 


Asp 


Glu 


Glu Gly Tyr 


Leu 










35 








40 






45 


Tyr 


Val 


Met 


Ser 


Arg 


Val 


Asp 


Asp Val 


He 


Asn 


Val Ala Gly 


His 










50 








55 






60 


Arg 


He 


Ser 


Ala 


Gly 


Ala 


He 


Glu Glu 


Ser 


He 


Leu Ser His 


Gly 










65 








70 






75 


Thr 


Val 


Ala 


Asp 


Cys 


Ala 


Val 


Val Gly 


Lys 


Glu 


Asp Pro Leu 


Lys 










80 








85 






90 


Gly 


His 


Val 


Pro 


Leu 


Ala 


Leu 


Cys Val 


Leu 


Arg 


Lys Asp He 


Asn 








95 






100 






105 


Ala 


Thr 


Glu 


Glu 


Gin 


Val 


Leu 


Glu Glu 


He 


Val 


Lys His Val 


Arg 










110 








115 




120 


Gin 


Asn 


He 


Gly 


Pro 


Val 


Ala 


Ala Phe 


Arg 


Asn 


Ala Val Phe 


Val 








125 








130 






135 


Lys 


Gin 


Leu 


Pro 


Lys 


Thr 


Arg 


Ser Gly 


Lys 


He 


Pro Arg Ser 


Ala 










140 








145 






150 


Leu 


Ser 


Ala 


He 


Val 


Asn 


Gly 


Lys Pro 


Tyr 


Lys 


He Thr Ser 


Thr 










155 






160 






165 


He 


Glu 


Asp 


Pro 


Ser 


He 


Phe 


Gly His 


val 


Glu 


Glu Met Leu 


Lys 
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170 175 180 

Gin Ala 



<210> 217 
<211> 359 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LI:332176'.l.orf2:2000FEB01 
<400> 217 



Leu 


lie 


Lys 


Arg 


Ser 


Lys Gly Lys Glu 


Arg 


Pro 


Phe Val Leu 


Thr 


1 








5 




10 






15 


Arg 


Ser 


Phe 


Phe 


Ala 
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Leu 




595 




600 


Val 


Leu 


Asn 


Glu 


Ala Arg 


Val 




610 






615 


Leu 


Phe 


Ser 


Asn 


Pro Leu 


Asp 




625 








630 


Met 


Val 


Glu 


Gly 


Ser Gly 


Leu 




640 






645 


Val 


Pro 


Thr 


Leu 


Gly Pro 


Lys 




655 








660 


He 


Leu 


Pro 


Ser 


Arg Ser 


Gly 




670 








675 


Ser 


Cys 


Asn 


Lys 


Phe Pro 


Ala 




685 








690 


Val 


Ala 


Glu 









700 



.orf2:2000MAY01 



111/228 



WO 01/62927 



PCT/US01/06059 



<400> 222 



Cys 


Thr 


lie Gly 


Pro 


Ala 


Ser 


Arg 


Ser 


Val 


Glu 


Thr Leu Lys 


Glu 


1 






5 










10 






15 


Met 


lie 


Lys Ser 


Gly 


Met 


Asn 


Val 


Ala 


Arg 


Leu 


Asn Phe Ser 


His 








20 










25 






30 


Gly 


Thr 


His Glu 


Tyr 


His 


Ala 


Glu 


Thr 


lie 


Lys 


Asn Val Arg 


Thr 








35 










40 






45 


Ala 


Thr 


Glu Ser 


Phe 


Ala 


Ser 


Asp 


Pro 


He 


Leu 


Tyr Arg Pro 


Val 








50 








55 






60 


Ala 


Val 


Ala Leu 


Asp 


Thr 


Lys 


Gly 


Pro 


Glu 


He 


Arg Thr Gly 


Leu 








65 










70 






75 


lie 


Lys 


Gly Ser 


Gly 


Thr 


Ala 


Glu 


Val 


Glu 


Leu 


Lys Lys Ala 


Ala 








80 










85 






90 


Thr 


Leu 


Lys He 


Thr 


Leu 


Asp 


Asn 


Ala 


Tyr 


Met 


Glu Lys Cys 


Asp 








95 










100 






105 


Glu 


Asn 


He Leu 


Trp 


Leu 


Asp 


Tyr 


Lys 


Asn 


He 


Cys Lys Val 


Val 








110 










115 






120 


Glu 


Val 


Ser Arg 


Leu 


His 


His 


Ala 


Val 


Trp 


Arg 


Asn Ser Gin 


Arg 








125 










130 






135 


Gly 


Leu 


Ser Ser 


Gly 


Gly 


Cys 


Ala 


His 


Ala 


Ala 


Pro Asp Ser 


Ser 



140 145 150 



<210> 223 
<211> 234 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 1190263 . 1 .orf 2 : 2000MAY01 
<400> 223 



Ala 


Ala 


Gly Ser 


Leu 


Phe Pro 


Gly Leu 


Leu 


He Phe Ser Met 


He 


1 




5 






10 




15 


Leu 


Phe 


He Phe 


Leu 


Leu Gly 


Tyr Ala 


Trp 


Phe Ser Ser His 


Thr 








20 






25 




30 


Ser 


Pro 


Leu Tyr 


Trp 


Asp Cys 


Leu Leu 


Met 


Arg Gly His Glu 


lie 








35 






40 




45 


Thr 


Glu 


Gin Pro 


Met 


Lys Ala 


Glu Arg 


Ala 


Gly Ser He Met 


Val 








50 




55 




60 


Lys 


Glu 


Ala He 


Ser 


Phe Leu 


Glu Arg 


His 


Ser Lys Glu Thr 


Phe 








65 






70 


75 


Leu 


Leu 


Phe Phe 


Ser 


Phe Leu 


His Val 


His 


Thr Pro Leu Pro 


Thr 








80 






85 




90 


Thr 


Asp 


Asp Phe 


Thr 


Gly Thr 


Ser Lys 


His 


Gly Leu Tyr Gly 


Asp 








95 






100 




105 


Asn 


Val 


Asp Glu 


Met 


Asp Ser 


Met Val 


Gly 


Lys He Leu Asp 


Ala 








110 






115 


120 


He 


Asp 


Asp Phe 


Gly 


Leu Arg 


Asn Asn 


Thr 


Leu Val Tyr Phe 


Thr 








125 






130 




135 


Ser 


Asp 


His Gly 


Gly 


His Leu 


Glu Ala 


Arg 


Arg Gly His Ala 


Gin 








140 






145 




150 


Leu 


Gly 


Gly Trp 


Asn 


Gly He 


Tyr Lys 


Gly 


Gly Lys Gly Met 


Gly 








155 






160 




165 


Gly 


Trp 


Glu Gly 


Gly 


He Arg 


Val Pro 


Gly 


He Val Arg Trp 


Pro 








170 






175 




180 


Gly 


Lys 


Val Pro 


Ala 


Gly Arg 


Leu He 


Lys 


Glu Pro Thr Ser 


Leu 








185 






190 




195 


Met 


Asp 


He Leu 


Pro 


Thr Val 


Ala Ser 


Val 


Ser Gly Gly Ser 


Leu 








200 






205 


210 


Pro 


Gin 


Asp Arg 


Val 


He Asp 


Gly Arg 


Asp 


Leu Met Pro Leu 


Leu 








215 






220 




225 


Ala 


Gly 


Gin Arg 


Gin 


Ala Leu 


Gly Ala 









230 
<210> 224 



112/228 



WO 01/62927 



PCT/US01/06059 



<211> 86 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG:270916. 2 .orf 2 :2000FEB18 
<400> 224 



Lys Phe 


Ser Ser 


Asn 


Gly Leu 


Trp Pro Ser 


Thr 


Trp Leu 


Leu Thr 


1 




5 




10 






15 


Thr Arg 


Thr Leu 


Pro 


Met He 


Ser Arg Cys 


Ser 


Pro Met 


His Leu 






20 




25 






30 


Leu Thr 


He Ser 


Ser 


Ala Phe 


Cys Leu Leu 


Cys 


Pro Pro 


Pro Arg 






35 




40 






45 


Met Pro 


Phe Gin 


Lys 


Cys Leu 


Leu Leu Ser 


Arg 


Tyr Arg 


Ser Arg 






50 




55 






60 


Gly Val 


Leu Val 


Ala 


Val He 


Trp Gly Thr 


Thr 


Glu Ala 


Ser Gly 






65 




70 






75 


He Ser 


Gly Leu 


He 


Thr Ala 


Leu Trp Glu 


Phe 










80 




85 









<210> 225 
<211> 173 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 999414 .3 .orf 2 :2000FKB18 
<400> 225 



He Leu 


Thr 


Ser 


Val 


Ser 


Ser 


Ser 


Phe 


Trp 


Cys 


Pro 


Phe 


Phe Leu 


1 






5 










10 








15 


Ser Leu 


Leu 


Asp 


Ser 


Gin 


Leu 


His 


Ser 


Trp 


He 


Val 


Leu 


Gin Leu 








20 










25 








30 


Thr He 


He 


Lys 


Asn 


Val 


Glu 


He 


Ser 


Asn 


Leu 


Val 


Cys 


Asp Pro 








35 










40 








45 


Ser Gin 


Leu 


Leu 


Asn 


Leu 


Ala 


Cys 


Ser 


Asp 


Ser 


Val 


He 


Asp Ser 








50 










55 








60 


He Phe 


He 


Tyr 


Leu 


Asp 


Ser 


Thr 


He 


Phe 


Gly 


Phe 


Leu 


Pro He 








65 










70 








75 


Ser Gly 


He 


Leu 


Leu 


Ser 


Tyr 


Tyr 


Lys 


He 


Val 


Pro 


Ser 


He Leu 








80 










85 








90 


Arg He 


Ser 


Ser 


Ser 


Asp 


Gly 


Lys 


Tyr 


Lys 


Ala 


Phe 


Ser 


Thr Cys 








95 










100 








105 


Arg Ser 


His 


Leu 


Ala 


Val 


Val 


Cys 


Leu 


Phe 


Tyr 


Gly Thr 


Gly He 








110 










115 








120 


Gly Val 


Tyr 


Leu 


Thr 


Ser 


Ala 


Val 


Ala 


Pro 


Ala 


Pro Arg 


Ser Gly 








125 










130 








135 


Val Val 


Val 


Ser 


Val 


Met 


Tyr 


Thr 


Val 


Val 


Thr 


Pro 


Met 


Leu Asn 








140 










145 








150 


Pro Phe 


He 


Tyr Cys 


Leu 


Arg 


Lys 


Gin 


Gly 


His 


Ser 


Lys 


Arg Leu 








155 










160 








165 


Trp Arg 


Cys 


Ala Ala 


Glu 


Gin 


Ser 















170 

<210> 226 

<211> 68 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:429446 . 1 .orf 2 :2000FEB18 
<400> 226 



113/228 



WO 01/62927 



PCT/US01/06059 



His Leu Val Ala Thr Val 

1 5 
Ser Arg He Lys Thr Val 
20 

Pro Gin Met Gin Ser Met 
35 

Arg Pro Ala Ser Pro Cys 
50 

Pro Pro Lys Glu He Ser 
65 

<210> 227 
<211> 70 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: LI:057229. l.orf 1:2000FEB01 

<400> 227 



Gin Pro Ser 


Leu 


Pro 


Glu Phe Ser His Phe 


Gin Lys Thr Val Leu 


1 




5 


10 


15 


Leu Glu Ser 


Lys 


He 


Ala Arg Gin Phe He 


Leu Phe Tvr Phe He 




20 


25 


30 


Leu His He 


Phe 


Leu 


Arg Gin Ser Leu Ala 


Leu Phe Pro Arg Leu 






35 


40 


45 


Glu Cys Gly 


Gly Ala 


Val Leu Ala His Cys 


Asn Leu Cys Leu Leu 




50 


55 


60 


Gly Ser Ser 


Asp 


Ser 


Pro Ala Ser Ala Ser 




65 


70 





<210> 228 
<211> 117 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI:351965 . 1 .orf 2 : 2000FEB01 
<400> 228 



Pro 


Thr 


Thr 


Ser Asn 


Arg Ala 


He Thr Leu 


Thr 


Ala Arg Pro Lys 


1 






5 


10 




15 


lie 


Pro 


Phe 


Leu Arg 


He Arg 


Glu Ala Lys 


Asn 


Pro Arg Ser Glu 








20 




25 




30 


Asn 


Met 


Arg 


Leu Ala 


Thr He 


Leu Glu Val 


Ala 


cys Arg His Phe 






35 




40 




45 


Gly 


Ser 


Gly 


Leu Pro 


Pro Ser 


Trp Glu Leu 


Trp 


Glu Gin Gly Pro 








50 




55 




60 


Pro 


Gly 


Asn 


Ser Ser 


Arg Tyr 


He Glu Phe 


Leu 


Asn Lys His Thr 






65 


70 




75 


Tyr 


He 


Lys 


Gly Thr 


Leu Arg 


Val Tyr Thr 


Lys 


Lys Phe Cys Met 








80 




85 




90 


Leu 


Val 


He 


Lys Ser 


Phe Glu 


Ser Lys Ser 


Cys 


Val Cys Val Tyr 








95 




100 




105 


Asp 


Phe 


Asp 


Ser Lys 


Ser Ser 


Val Asn Val 


Thr 


Val 




110 




115 







<210> 229 
<211> 294 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 068682 . l.orf 2 :2000FEB18 



Arg Gly Phe Ser Lys Val Phe Val Ser 
10 15 
Lys Leu Gin He Val Leu Gin Met Glu 
25 30 
Thr Lys He Tyr His Arg Pro Leu Asp 
40 45 
Ser Asp Val Asp Asp He Glu Gly Ala 
55 60 

Thr Ala 
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WO 01/62927 



PCT/US01/06059 



<400> 229 



Gin 


Arg 


Pro Met 


Ser 


Gly 


Ser 


Gly 


His 


Met 


Gly 


Val Arg Gly 


Cys 


1 






5 










10 






15 


Arg 


Cys 


Gin Ala 


Pro 


Trp 


Leu 


Arg 


Pro 


Glu 


lie 


Gly Val Arg 


Pro 






20 










25 






30 


Pro 


Pro 


Arg Ser 


Gin 


Ala 


Ala 


Ser 


Cys 


Pro 


Pro 


Cys Ala Leu 


Gly 








35 








40 






45 


Ala 


Thr 


Met Ser 


Gly 


Asp 


Lys 


Leu 


Leu 


Ser 


Glu 


Leu Gly Tyr 


Lys 








50 










55 






60 


Leu 


Gly 


Arg Thr 


lie 


Gly 


Glu 


Gly 


Ser 


Tyr 


Ser 


Lys Val Lys 


Val 








65 










70 






75 


Ala 


Thr 


Ser Lys 


Lys 


Tyr 


Lys 


Gly 


Thr 


Val 


Ala 


He Lys Val 


Val 








80 










85 






90 


Asp 


Arg 


Arg Arg 


Ala 


Pro 


Pro 


Asp 


Phe 


Val 


Asn 


Lys Phe Leu 


Pro 








95 










100 






105 


Arg 


Glu 


Leu Ser 


lie 


Leu 


Arg 


Gly 


Val 


Arg 


His 


Pro His He 


Val 








110 






115 






120 


His 


Val 


Phe Glu 


Phe 


lie 


Glu 


Val 


Cys 


Asn 


Gly 


Lys Leu Tyr 


He 








125 










13 0 






135 


Val 


Met 


Glu Ala 


Ala 


Ala 


Thr 


Asp 


Leu 


Leu 


Gin 


Ala Val Gin 


Arg 








140 








145 






150 


Asn 


Gly 


Arg lie 


Pro 


Gly 


Val 


Gin 


Ala 


Arg 


Asp 


Leu Phe Ala 


Gin 








155 










160 






165 


lie 


Ala 


Gly Ala 


Val 


Arg 


Tyr 


Leu 


His 


Asp 


His 


His Leu Val 


His 








170 










175 






180 


Arg 


Asp 


Leu Lys 


Cys 


Glu 


Asn 


Val 


Leu 


Leu 


Ser 


Pro Asp Glu 


Arg 








185 










190 






195 


Arg 


Val 


Lys Leu 


Thr 


Asp 


Phe 


Gly 


Phe 


Gly 


Arg 


Gin Ala His 


Gly 








200 










205 






210 


Tyr 


Pro 


Asp Leu 


Ser 


Thr 


Thr 


Tyr 


Cys 


Gly 


Ser 


Ala Ala Tyr 


Ala 






215 










220 






225 


Ser 


Pro 


Glu Val 


Leu 


Leu 


Gly 


He 


Pro 


'Tyr 


Asp 


Pro Lys Lys 


Tyr 








230 










235 






240 


Asp 


Val 


Trp Ser 


Met 


Gly 


Val 


Val 


Leu 


Tyr 


Val 


Met Val Thr 


Gly 




245 








250 






255 


Cys 


Met 


Pro Phe 


Asp 


Asp 


Ser 


Asp 


lie 


Ala 


Gly 


Leu Pro Arg 


Arg 






260 










265 






270 


Gin 


Lys 


Arg Gly 


Val 


Leu 


Tyr 


Pro 


Glu 


Gly 


Leu 


Glu Leu Ser 


Glu 






275 










280 






285 


Arg 


Cys 


Lys Ala 


Leu 


He 


Ala 


Glu 


Leu 











290 

<210> 230 
<211> 326 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG:242665 . 1 .orf 1:2000FEB18 
<400> 230 



His 


His 


Leu 


He 


Ser 


Leu 


Tyr 


Phe 


Thr 


Asp 


Phe 


Pro He Ser 


Phe 


1 








5 








10 






15 


Phe 


Met 


Phe 


Tyr 


Ala 


Asn 


Phe 


Ser 


Arg 


Arg 


Thr 


Gly Pro Ala 


Pro 










20 










25 






30 


Pro 


Leu 


Arg 


Thr 


Thr 


Pro 


Arg 


Ala 


Trp 


Leu 


Arg 


Arg Glu Cys 


Gly 










35 










40 






45 


Ala 


Ser 


Thr 


Met 


Ser 


Ala 


Pro 


Gly 


Ser 


Pro 


Asp 


Gin Ala Tyr 


Asp 










50 








55 






60 


Phe 


Leu 


Leu 


Lys 


Phe 


Leu 


Leu 


Val 


Gly 


Asp 


Arg 


Asp Val Gly 


Lys 










65 










70 






75 


Ser 


Glu 


He 


Leu 


Glu 


Ser 


Leu 


Gin 


Asp 


Gly 


Ala 


Ala Glu Ser 


Pro 










80 








85 






90 


Tyr 


Ser 


His 


Leu 


Gly 


Gly 


He 


Asp 


Tyr 


Lys 


Thr 


Thr Thr He 


Leu 










95 










100 






105 


Leu 


Asp 


Gly 


Gin 


Arg 


Val 


Lys 


Leu 


Lys 


Leu 


Trp 


Asp Thr Ser 


Gly 
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110 








115 






120 


Gin 


Gly 


Arg Phe 


Cys 


Thr 


He 


Phe Arg 


Ser 


Tyr 


Ser Arg Gly 


Ala 




125 








130 






135 


Gin 


Gly 


Val He 


Leu 


Val 


Tyr 


Asp He 


Ala 


Asn 


Arg Trp Ser 


Phe 






140 








145 






150 


Glu 


Gly 


Met Asp 


Arg 


Trp 


He 


Lys Lys 


He 


Glu 


Glu His Ala 


Pro 






155 








160 






165 


Gly 


Val 


Pro Lys 


He 


Leu 


Val 


Gly Asn 


Arg 


Leu 


His Leu Ala 


Phe 




170 








175 






180 


Lys 


Arg 


Gin Val 


Pro 


Arg 


Glu 


Gin Ala 


Gin 


Ala 


Tyr Ala Glu 


Arg 




185 






190 






195 


Leu 


Gly 


Val Thr 


Phe 


Phe 


Glu 


Val Ser 


Pro 


Leu 


Cys Asn Phe 


Asn 






200 








205 






210 


He 


He 


Glu Ser 


Phe 
215 


Thr 


Glu 


Leu Ala 


Arg 
220 


He 


Val Leu Leu 


Arg 
225 


His 


Arg 


Met Asn 


Trp 


Leu 


Gly 


Arg Pro 


Ser 


Lys 


Val Leu Ser 


Leu 






230 






235 






240 


Gin 


Asp 


Leu Cys 


Cys 


Arg 


Thr 


He Val 


Ser 


Cys 


Thr Pro Val 


His 






245 








250 






255 


Leu 


Val 


Asp Lys 


Leu 


Pro 


Leu 


Pro Ser 


Thr 


Leu 


Arg Ser His 


Leu 






260 








265 






270 


Lys 


Ser 


Phe Ser 


Met 


Ala 


Lys 


Gly Leu 


Asn 


Ala 


Arg Met Met 


Arg 






275 








280 






285 


Gly 


Leu 


Ser Tyr 


Ser 


Leu 


Thr 


Thr Ser 


Ser 


Thr 


His Lys Arg 


Ser 




290 








295 






300 


Ser 


Leu 


Cys Lys 


Val 


Lys 


He 


Val Cys 


Pro 


Pro 


Gin Ser Pro 


Pro 






305 








310 






315 


Lys 


Asn 


Cys Thr 


Arg 
320 


Asn 


Ser 


Cys Lys 


He 
325 


Ser 







<210> 231 
<211> 182 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG:241743 .1 .orf 1:2000PEB18 
<400> 231 



Lys 


Ser 


Gly 


Thr 


Pro 


Arg 


Arg 


Ala Leu 


Leu 


Leu Phe Leu Val 


Phe 


1 






5 








10 




15 


Lys 


He 


Arg 


Gly 


Ser 


Pro 


Val 


Ser His 


Leu 


Met Pro Arg Leu 


Lys 








20 








25 




30 


Glu 


Ser 


Arg 


Ser 


His 


Glu 


Ser 


Leu Leu 


Ser 


Pro Ser Ser Ala 


Val 








35 








40 




45 


Glu 


Ala 


Leu 


Asp 


Leu 


Ser 


Met 


Glu Glu 


Glu 


Val Val He Lys 


Pro 








50 








55 




60 


val 


His 


Ser 


Ser 


He 


Leu 


Gly 


Gin Asp 


Tyr 


Cys Phe Glu Val 


Thr 










65 






70 




75 


Thr 


Ser 


Ser 


Gly 


Ser 


Lys 


Cys 


Phe Ser 


Cys 


Arg Ser Ala Ala 


Glu 








80 




85 




90 


Arg 


Asp 


Lys 


Trp 


Met 


Glu 


Asn 


Leu Arg 


Arg 


Ala Val His Pro 


Asn 










95 








100 




105 


Lys 


Asp 


Asn 


Ser 


Arg 


Arg 


Val 


Glu His 


He 


Leu Lys Leu Trp 


Val 










110 








115 




120 


He 


GlU 


Ala 


Lys 


Asp 


Leu 


Pro 


Ala Lys 


Lys 


Lys Tyr Leu Cys 


Glu 










125 








130 




135 


Leu 


Cys 


Leu 


Asp 


Asp 


Val 


Leu 


Tyr Ala 


Arg 


Thr Thr Gly Lys 


Leu 










140 








145 




150 


Lys 


Thr 


Asp 


Asn 


Val 


Phe 


Trp 


Gly Glu 


His 


Phe Glu Phe His 


Asn 








155 








160 




165 


Leu 


Pro 


Pro 


Leu 


Arg 


Thr 


Val 


Thr Val 


His 


Leu Tyr Arg Glu 


Thr 



170 175 180 

Asp Lys 
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<210> 232 
<211> 358 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 034212 .1 .orf 1:200 0FEB01 
<220> 

<221> unsure 
<222> 25 

<223> unknown or other 
<400> 232 



Asn 


Ser 


Ser 


Leu Thr 


Gin 


Leu 


Arg 


Arg 


Leu 


Glu 


Glu 


Leu Asp Leu 


1 






5 








10 






15 


Gly 


Asn 


Asn 


Glu He 


Tyr 


Asn 


Leu 


Pro 


Xaa 


Ser 


He 


Gly Ala Leu 






20 








25 






30 


Leu 


His 


Leu 


Lys Asp 


Leu 


Trp 


Leu 


Asp 


Gly 


Asn 


Gin 


Leu Ser Glu 








35 








40 






45 


Leu 


Pro 


Gin 


Glu lie 


Gly 


Asn 


Leu 


Lys 


Asn 


Leu 


Leu 


Cys Leu Asp 








50 










55 






60 


Val 


Ser 


Glu 


Asn Arg 


Leu 


Glu 


Arg 


Leu 


Pro 


Glu 


Glu 


He Ser Gly 








65 








70 






75 


Leu 


Thr 


Ser 


Leu Thr 


Asp 


Leu 


Val 


He 


Ser 


Gin 


Asn 


Leu Leu Glu 








80 








85 






90 


Thr 


lie 


Pro 


Asp Gly 


He 


Gly 


Lys 


Leu 


Lys 


Lys 


Leu 


Ser He Leu 








95 










100 






105 


Lys 


Val 


Asp 


Gin Asn 


Arg 


Leu 


Thr 


Gin 


Leu 


Pro 


Glu 


Ala Val Gly 




110 










115 






120 


Glu 


Cys 


Glu 


Ser Leu 


Thr 


Glu 


Leu 


Val 


Leu 


Thr 


Glu 


Asn Gin Leu 






125 










130 






135 


Leu 


Thr 


Leu 


Pro Lys 


Ser 


He 


Gly 


Lys 


Leu 


Lys 


Lys 


Leu Ser Asn 








140 










145 






150 


Leu 


Asn 


Ala 


Asp Arg 


Asn 


Lys 


Leu 


Val 


Ser 


Leu 


Pro 


Lys Glu He 








155 








160 






165 


Gly 


Gly 


Cys 


Cys Ser 


Leu 


Thr 


Val 


Phe 


cys 


Val 


Arg 


Asp Asn Arg 








170 










175 






180 


Leu 


Thr 


Arg 


He Pro 


Ala 


Glu 


Val 


Ser 


Gin 


Ala 


Thr 


Glu Leu His 






185 










190 






195 


Val 


Leu 


Asp 


Val Ala 


Gly 


Asn 


Arg 


Leu 


Leu 


His 


Leu 


Pro Leu Ser 








200 










205 






210 


Leu 


Thr 


Ala 


Leu Lys 


Leu 


Lys 


Ala 


Leu 


Trp 


Leu 


Ser 


Asp Asn Gin 








215 








220 






225 


Ser 


Gin 


Pro 


Leu Leu 


Thr 


Phe 


Gin 


Thr 


Asp 


Thr 


Asp 


Tyr Thr Thr 








230 










235 






240 


Gly 


Glu 


Lys 


He Leu 


Thr 


Cys 


Val 


Leu 


Leu 


Pro 


Gin 


Leu Pro Ser 






245 








250 






255 


Glu 


Pro 


Thr 


Cys Gin 


Glu 


Asn 


Leu 


Pro 


Arg 


Cys 


Gly 


Ala Leu Glu 








260 










265 


270 


Asn 


Leu 


Val 


Asn Asp 


Val 


Ser 


Asp 


Glu 


Ala 


Trp 


Asn 


Glu Arg Ala 








275 








280 






285 


Val 


Asn 


Arg 


Val Ser 


Ala 


He 


Arg 


Phe 


Val 


Glu 


Asp 


Glu Lys Asp 








290 










295 






300 


Glu 


Glu 


Asp 


Asn Glu 


Thr 


Arg 


Thr 


Leu 


Leu 


Arg 


Arg 


Ala Thr Pro 






305 








310 






315 


His 


Pro 


Gly 


Glu Leu 


Lys 


His 


Met 


Lys 


Lys 


Thr 


Val 


Glu Asn Leu 






320 








325 






330 


Arg 


Asn 


Asp 


Met Asn 


Ala 


Ala 


Lys 


Gly 


Leu 


Asp 


Ser 


Asn Lys Asn 








335 










340 






345 


Glu 


Val 


Asn 


His Ala 


He 


Asp 


Arg 


Val 


Thr 


Thr 


Ser 


Val 
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<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 344886 . 1 . orf 1 : 2000MAY19 
<400> 233 



Glu 


Lys 


Met Gly 


Lys 


Gly Cys Lys Val 


Val 


Val 


Cys 


Gly Leu 


Leu 


1 






5 




10 








15 


Ser 


Val 


Gly Lys 


Thr 


Ala He Leu Glu 


Gin 


Leu 


■Leu 


Tyr Gly 


Asn 








20 




25 








30 


His 


Thr 


lie Gly 


Met. 


Glu Asp Cys Glu 


Thr 


Met 


Glu 


Asp Val 


Tyr 








35 




40 








45 


Met 


Ala 


Ser Val 


Glu 


Thr Asp Arg Gly 


Val 


Lys 


Glu 


Gin Leu 


His 








50 


55 








60 


Leu 


Tyr 


Asp Thr 


Arg 


Gly Leu Gin Glu 


Gly 


Val 


GlU 


Leu Pro 


Lys 








65 




70 








75 


His 


Tyr 


Phe Ser 


Phe 


Ala Asp Gly Phe 


Val 


Leu 


Val 


Tyr Ser 


Val 








80 


85 








90 


Asn 


Asn 


Leu Glu 


Ser 


Phe Gin Arg Val 


Glu 


Leu 


Leu 


Lys Lys 


Glu 








95 


100 






105 


lie 


Asp 


Lys Phe 


Lys 


Asp Lys Lys Glu 


Val 


Ala 


He 


Val Val 


Leu 








110 




115 








120 


Gly 


Asn 


Lys lie 


Asp 


Leu Ser Glu Gin 


Arg 


Gin 


Val 


Asp Ala 


Glu 






125 




130 








135 


Val 


Ala 


Gin Gin 


Trp 


Ala Lys Ser Glu 


Lys 


Val 


Arg 


Leu Trp 


Glu 








140 


145 








150 


Val 


Thr 


Val Thr 


Asp 


Arg Lys Thr Leu 


He 


Glu 


Pro 


Phe Thr 


Leu 








155 


160 








165 


Leu 


Ala 


Ser Lys 


Leu 


Ser Gin Pro Gin 


Ser 


Lys 


Ser 


Ser Phe 


Pro 






170 




175 








180 


Leu 


Pro 


Gly Arg 


Lys 


Asn Lys Gly Asn 


Ser 


Asn 


Ser 


Glu Asn 





185 190 

<210> 234 
<211> 222 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG:228930 . 1 .orf 2 : 2000MAY19 
<400> 234 



Ala 


Gin 


Met 


Ala 


Gly 


Ala 


Gin 


Pro Gly 


Val 


His 


Ala 


Leu Gin 


Leu 


1 








5 








10 








15 


Lys 


Pro 


Val 


Cys 


Val 


Ser 


Asp 


Ser Leu 


Lys 


Lys 


Gly 


Thr Lys 


Phe 










20 








25 








30 


Val 


Lys 


Trp 


Asp 


Asp 


Asp 


Ser 


Thr He 


Val 


Thr 


Pro 


He He 


Leu 










35 








40 








45 


Arg 


Thr 


Asp 


Pro 


Gin 


Gly 


Phe 


Phe Phe 


Tyr 


Trp 


Thr 


Asp Gin 


Asn 










50 








55 








60 


Lys 


Glu 


Thr 


Glu 


Leu 


Leu 


Asp 


Leu Ser 


Leu 


Val 


Lys 


Asp Ala 


Arg 










65 








70 








75 


Cys 


Gly 


Arg 


His 


Ala 


Lys 


Ala 


Pro Lys 


Asp 


Pro 


Lys 


Leu Arg 


Glu 










80 








8 ? 








90 


Leu 


Leu 


Asp 


Val 


Gly 


Asn 


lie 


Gly Arg 


Leu 


Glu 


Gin 


Arg Met 


He 








95 








100 








105 


Thr 


Val 


Val 


Tyr 


Gly 


Pro 


Asp 


Leu Val 


Asn 


He 


Ser 


His Leu 


Asn 








110 








115 








120 


Leu 


Val 


Ala 


Phe 


Gin 


Glu 


Glu 


Val Ala 


Lys 


Glu 


Trp 


Thr Asn 


Glu 










125 








130 






135 


Val 


Phe 


Ser 


Leu 


Ala 


Thr 


Asn 


Leu Leu 


Ala 


Gin 


Asn 


Met Ser 


Arg 










140 








145 








150 


Asp 


Ala 


Phe 


Leu 


Glu 


Lys 


Ala 


Tyr Thr 


Lys 


Leu 


Lys 


Leu Gin 


val 








155 








160 






165 


Thr 


Pro 


Glu 


Gly 


Arg 


He 


Pro 


Leu Lys 


Asn 


He 


Tyr 


Arg Leu 


Phe 
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170 

Ser Ala Asp Arg Lys 
185 

Leu Pro Ser Ser Arg 
200 

Glu Gin Ser Cys Gly 
215 

<210> 235 
<211> 185 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: LG : 338927 . 1 .orf 3 :2000MAY19 

<400> 235 



Leu 


Arg 


Ser 


Thr Pro 


Glu Thr Gly Arg 


Met 


Lys 


Gly Ala Ser 


Glu 


1 






5 




10 






15 


Glu 


Lys 


Leu 


Ala Ser 


Val Ser Asn Leu 


Val 


Thr 


Val Phe Glu 


Asn 








20 




25 






30 


Ser 


Arg 


Thr 


Pro Glu 


Ala Ala Pro Arg 


Gly 


Gin 


Arg Leu Glu 


Asp 








35 




40 






45 


Val 


His 


His 


Arg Pro 


Glu Cys Arg Pro 


Pro 


Glu 


Ser Pro Gly 


Pro 








50 


55 




60 


Arg 


Glu 


Lys 


Thr Asn 


Val Gly Glu Ala 


Val 


Gly 


Ser Glu Pro 


Arg 








65 




70 






75 


Thr 


Val 


Ser 


Arg Arg 


Tyr Leu Asn Ser 


Leu 


Lys 


Asn Lys Leu 


Ser 








80 




85 






90 


Ser 


Glu 


Ala 


Trp Arg 


Lys Ser Cys Gin 


Pro 


Val 


Thr Leu Ser 


Gly 








95 




100 






105 


Ser 


Gly 


Thr 


Gin Glu 


Pro Glu Lys Lys 


He 


Val 


Gin Glu Leu 


Leu 








110 


115 






120 


Glu 


Thr 


Glu 


Gin Ala 


Tyr Val Ala Arg 


Leu 


His 


Leu Leu Asp 


Gin 








125 




130 






135 


Val 


Phe 


Phe 


Gin Glu 


Leu Leu Lys Thr 


Ala 


Arg 


Ser Ser Lys 


Ala 








140 


145 






150 


Phe 


Pro 


Glu 


Asp Val 


Val Arg Val lie 


Phe 


Ser 


Asn He Ser 


Ser 








155 


160 






165 


He 


Tyr 


Gin 


Phe His 


Ser Gin Phe Phe 


Leu 


Pro 


Glu Leu Gin 


Arg 






170 




175 






180 


Arg 


Leu 


Asp 


Asp Trp 

1 Q C 
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<223> Incyte ID No: LG: 898771.1. orf 2 :200.0MAY19 
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Arg Pro Leu 


Glu 


His 


Gly 


Thr His Arg 


His 


He 


Ala Ser Leu 


Lys 


1 




5 






10 






15 


Thr Glu Glu 


Thr 


Arg 


Arg 


Ala Arg Pro 


Ala 


Ala 


Ala Gin Ala 


Val 






20 






25 






30 


Tyr Leu Pro 


Val 


Ser 


Gin 


His Gly His 


Gin 


Asp 


Pro Val His 


Phe 






35 






40 




45 


Ala Leu Ser 


Gin 


Arg 


Arg 


Gly Pro Ser 


Leu 


Pro 


Ala Ala Ala 


Thr 






50 






55 






60 


Val Pro Pro 


Asp 


Leu 


Pro 


Ser Glu Asp 


Pro 


His 


Pro Gly Ala 


Gly 






65 






70 




75 


Pro Pro Glu 


His 


Gly 


Gin 


Pro Arg Pro 


Leu 


Pro 


Asp Gly His 


His 






80 






85 




90 



175 180 
Arg Val Glu Thr Ala Leu Glu Ala Cys Ser 
• 190 195 
Val Glu Lys Ala Asn Glu Ala Ala Lys Ser 
205 210 
Lys Ala Pro Pro Lys His Phe 
220 
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Gin Cys 


Pro Gin 


Leu 


Leu 


Pro 


Ser 


Gin 


Ser 


Thr 


Arg Cys Arg Leu 




95 










100 




105 


Leu Gin 


Leu Pro 


Leu 


Cys 


Ala 


Glu 


Arg 


Asp 


Leu 


Gly Pro Ala Ala 






110 










115 




120 


Gly Ser 


Arg Val 


Cys 


Ser 


Lys 


Gly 


Arg 


Val 


Gly 


Ala Ala Gly Arg 






125 










130 




135 


His Val 


Trp Arg 


Arg 


Gin 


Pro 


Gin 


Gly 


Leu 


Ser 


Pro Pro Gly Ala 






140 










145 




150 


Val Val 


His Leu 


Val 


Thr 


Gin 


Asp 


Arg 


Ala 


lie 


Val Thr Arg Arg 






155 








160 




165 


Gly Arg 


His Arg 


Gin 


Pro 


Arg 


Ala 


Cys 


Gly 


Arg 


Val Leu Glu Val 






170 










175 




180 


Val Ser 


Ala His 


Arg 


Glu 


Trp 


Ser 


Arg 


Ser 


Trp 


Arg 
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<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 257664 . 67 . orf 3 :2000MAY01 
<400> 237 

Ala His Ser Lys Pro Glu Lys lie Val Asn Lys Pro Asn Lys His 
15 10 15 

Glu Asp His Thr Gly Lys Leu Arg Pro Glu Thr Arg Glu Glu Asn 

20 25 30 

Lys Asn His Leu Lys Asp His Gin Pro Tyr Trp His Thr Phe Val 

35 40 45 

Asn Asn Thr Gin Phe Pro Asp lie Trp Glu Gin Val Lys Cys Val 

50 55 60 

Thr 



<210> 238 
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<212> PRT" 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI :001496 .2 .orf 2:2000MAY01 
<400> 238 



Arg 


Cys 


Gly 


Ala 


Ala 


Ala 


Ser Ala Gly 


Arg 


Glu 


Ser Ala Ala Gly 


1 








5 






10 




15 


Ser 


Glu 


Glu 


Gin 


Ala 


Gly 


Leu Arg Pro 


Ser 


Gin 


Leu Arg Gly Pro 










20 




25 




30 


Pro 


Asp 


Pro 


Pro 


Thr 


Glu 


Thr Ala Ala 


Val 


Ser 


Gly Gin Ala Val 










35 






40 




45 


Gly 


Ala 


Ala 


Trp 


Pro 


Ala 


Ala Gly Lys 


Met 


Phe 


Ser Val Glu Ser 






50 




55 




60 


Leu 


Glu 


Arg 


Ala 


Glu 


Leu 


Cys Glu Ser 


Leu 


Leu 


Thr Trp He Gin 










65 




70 




75 


Thr 


Phe 


Asn 


Val 


Gly 


Cys 


Thr .Met Pro 


Glu 


Pro 


Val Glu Asp Leu 










80 






85 




90 


Thr 


Asn 


Gly 


Val 


Val 


Met 


Ala Gin Val 


Leu 


Gin 


Lys He Asp Pro 








95 






100 




105 


Ala 


Tyr 


Phe 


Asp 


Glu 


Asn 


Trp Leu Asn 


Arg 


He 


Lys Thr Glu Val 










110 






115 




120 


Gly 


Asp 


Asn 


Trp 


Arg 


Leu 


Lys He Ser 


Asn 


Leu 


Lys Lys He Leu 








125 






130 




135 


Lys 


Gly 


He 


Leu 


Asp 


Tyr 


Asn His Glu 


He 


Leu 


Gly Gin Gin He 








140 




145 




150 


Asn 


Asp 


Phe 


Thr 


Leu 


Pro 


Asp Val Asn 


Leu 


He 


Gly Glu His Ser 
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155 










160 




165 


Asp 


Ala 


Ala Glu 


Leu 


Gly 


Arg 


Met 


Leu 


Gin 


Leu 


He Leu Gly Cys 






170 






175 




180 


Ala 


Val 


Asn Cys 


Glu 


Gin 


Lys 


Gin 


Glu 


Tyr 


He 


Gin Ala He Met 






185 








190 




195 


Met 


Met 


Glu Glu 


Ser 
200 


Val 


Gin 


His 


Val 


Val 
205 


Met 


Thr Ala He Gin 
210 


Glu 


Leu 


Met Ser 


Lys 
215 


Glu 


Ser 


Pro 


Val 


Ser 
220 


Ala 


Gly Asn Asp Ala 
225 


Tvr 


Val 


Asp Leu 


Asp 


Arg 


Gin 


Leu 


Lys 


Lys 


Thr 


Thr Glu Glu Leu 






230 










235 




240 


Asn 


Glu 


Ala Leu 


Ser 


Ala 


Lys 


GlU 


Glu 


He 


Ala 


Gin Arg Cys His 








245 








250 




255 


Glu 


Leu 


Asp Met 


Gin 
260 


Val 


Ala 


Ala 


Leu 


Gin 
265 


Glu 


Glu Lys Ser Ser 
270 


Leu 


Leu 


Ala Glu 


Asn 
275 


Gin 


Val 


Leu 


Met 


Glu 
280 


Arg 


Leu Asn Gin Ser 
285 


Asp 


Ser 


He Glu 


Asp 


Pro 


Asn 


Ser 


Pro 


Ala 


Gly 


Arg Arg His Leu 






290 










295 




300 


Gin 


Leu 


Gin Thr 


Gin 
305 


Leu 


Glu 


Gin 


Leu 


Gin 
310 


Glu 


Glu Thr Phe Arg 
315 


Leu 


Glu 


Ala Ala 


Lys 
320 


Asp 


Asp 


Tyr 


Arg 


He 
325 


Arg 


Cys Glu Glu Leu 
330 


GlU 


Lys 


Gly Asp 


Leu 
335 
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<221> mis cofeature 

<223> Incyte ID No: LI:1085273 .2 .orf 1 :2000MAY01 
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Ala 


Arg 


Ala 


Gly 


Ser 


Pro 


Pro 


Arg 


Pro 


Pro 


Arg 


Pro Arg Arg Pro 


1 




5 










10 




15 


Ala 


His 


Cys 


Ser 


Arg 


Ala 


Cys 


Ala 


Ala 


Cys 


Thr 


Ser Pro Arg Thr 








20 










25 




30 


Ala 


Cys 


Arg 


Thr 


Leu 


Thr 


Ala 


Ser 


Ser 


Ala 


Pro 


Ser Pro Trp Thr 






35 










40 




45 


Ser 


Ser 


Leu 


Pro 


Thr 
50 


Pro 


Leu 


Ala 


Gly 


Gly 
55 


Pro 


Thr Ala Pro Gly 
60 


Pro 


Pro 


Thr 


Pro 


Ala 


Arg 


Pro 


Arg 


Ser 


Ser 


Ala 


Ser Trp Thr Ala 










65 








70 




75 


Arg 


Arg 


Gly 


Pro 


Arg 


Trp 


Ala 


Cys 


Pro 


Arg 


Pro 


Ala Arg Thr Ala 






80 










85 




90 


Arg 


Thr 


Pro 


Arg 


Leu 
95 


Arg 


Arg 


Gly 


Pro 


Arg 
100 


Pro 


Arg Arg Arg Pro 
105 


Arg 


Pro 


Pro 


Ala 


Gly 


Ser 


Pro 


Ala 


Arg 


Ser 


Pro 


Ala His Ser Leu 








110 










115 




120 


Gly 


Leu 


Asn 


Phe 


Gly 
125 


Asp 


Ala 


Ala 


Arg 


Gin 
130 


Thr 


Pro Arg His Gly 
135 


Leu 


Ser 


Ala 


Leu 


Ser 
140 


Ala 


Pro 


Gly 


Leu 


Pro 
145 


Gly 


Pro Gly Gin Pro 
150 


Ala 


Gly 


Pro 


Gly 


Ala 


Trp 


Ala 


Pro 


Pro 


Leu 


Asp 


Ser Pro Gly Thr 








155 










160 




165 


Pro 


Ser 


Pro 


Asp 


Gly 


Pro 


Trp 


Cys 


Phe 


Ser 


Pro 


Glu Gly Ala Gin 








170 










175 




180 


Gly 


Ala 


Gly 


Gly 


Val 


Leu 


Phe 


Ala 


Pro 


Phe 


Gly 


Arg Ala Gly Ala 




185 










190 




195 


Pro 


Gly 


Pro 


Gly 


Gly 
200 


Gly 


Ser 


Asp 


Leu 


Arg 
205 


Arg 


Arg Glu Ala Ala 
210 


Arg 


Ala 


Glu 


Pro 


Arg 


Asp 


Ala 


Arg 


Thr 


Gly 


Trp 


Pro Glu Glu Pro 








215 










220 




225 


Ala 


Pro 


GlU 


Thr 


Gin 


Phe 


Lys 


Arg 


Arg 


Ser 


Cys 


Gin Met Glu Phe 
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230 








235 






240 


GlU 


Glu 


Gly Met 


Val 


Glu 


Gly 


Arg Ala 


Arg 


Gly 


Glu 


Glu Leu Ala 








245 








250 






255 


Ala 


Leu 


Gly Lys 


Gin 


Ala 


Ser 


Phe Ser 


Gly 


Asn 


Val 


Glu Val He 






260 








265 






270 


Gin 


Val 


Val Ser 


Asp 


Pro 


Ser 


Ala Ala 


Phe 


Gly 


Pro 


Ala Ala Arg 








275 








280 






285 


Ser 


Gin 


Ala Arg 


Asn 


Lys 


Cys 


He Leu 


Tyr 


He 


Met 


Gin Arg Lys 








290 






295 






300 


Val 


Asn 


Gly Leu 


Thr 


Gly 


Asn 


Phe Asn 


Pro 


Arg 


Ser 


Lys He He 






305 






310 






315 


Ser 


lie 


Phe Tyr 


Leu 


Phe 


Lys 


Leu Phe 


He 


Leu 


Ala 


Met Asp Leu 






320 






325 






330 


Ala 


Thr 


Val Arg 


Val 


Val 


Leu 


Glu Leu 


Pro 


Phe 


Leu 


Leu Ser Gly 






335 








340 






345 


He 
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Ala 


Thr 


Trp 


Ala 


Phe 


He 


Ser 


Ala Pro 


Val 


Pro 


Val 


Phe Pro Asp 


1 






5 








10 






15 


Ser 


Phe 


Gly 


He 


Lys 


Ala 


Ser 


Ser Glu 


Ala 


Ser 


Thr 


Leu Glu Ala 










20 








25 






30 


Met 


Gly 


Arg 


Lys 


Glu 


Glu 


Asp 


Asp Cys 


Ser 


Ser 


Trp 


Lys Lys Gin 










35 








40 






45 


Thr 


Thr 


Asn 


He 


Arg 


Lys 


Thr 


Phe He 


Phe 


Met 


Glu 


Val Leu Gly 










50 






55 






60 


Ser 


Gly 


Ala 


Phe 


Ser 


Glu 


Val 


Phe Leu 


Val 


Lys 


Gin 


Arg Leu Thr 










65 








70 






75 


Gly 


Lys 


Leu 


Phe 


Ala 


Leu 


Lys 


Cys He 


Lys 


Lys 


Ser 


Pro Ala Phe 










80 








85 






90 


Arg 


Asp 


Ser 


Ser 


Leu 


Glu 


Asn 


Glu He 


Ala 


Val 


Leu 


Lys Lys He 








95 








100 






105 


Lys 


His 


Glu 


Asn 


He 


Val 


Thr 


Leu Glu 


Asp 


He 


Tyr 


Glu Ser Thr 








110 








115 






120 


Thr 


His 


Tyr 


Tyr 


Leu 


Val 


Met 


Gin Leu 


Val 


Ser 


Gly 


Gly Glu Leu 






125 








130 






135 


Phe 


Asp 


Arg 


He 


Leu 


Glu 


Arg 


Gly Val 


Tyr 


Thr 


Glu 


Lys Asp Ala 










140 








145 






150 


Ser 


Leu 


Val 


He 


Gin 


Gin 


Val 


Leu Ser 


Ala 


Val 


Lys 


Tyr Leu His 










155 








160 






165 


Glu 


Asn 


Gly 


He 


Val 


His 


Arg 


Asp Leu 


Lys 


Pro 


Glu 


Asn Leu Leu 










170 








175 






180 


Tyr 


Leu 


Thr 


Pro 


Glu 


Glu 


Asn 


Ser Lys 


He 


Met 


He 


Thr Asp Phe 








185 






190 






195 


Gly 


Leu 


Ser 


Lys 


Met 


Glu 


Gin 


Asn Gly 


He 


Met 


Ser 


Thr Ala Cys 










200 








205 






210 


Gly 


Thr 


Pro 


Gly 


Tyr 


Val 


Ala 


Pro Glu 


Val 


Leu 


Ala 


Gin Lys Pro 










215 








220 






225 


Tyr 


Ser 


Lys 


Ala 


Val 


Asp 


Cys 


Trp Ser 


He 


Gly 


Val 


He Thr Tyr 










230 








235 






240 


He 


Leu 


Leu 


Cys 


Gly 


Tyr 


Pro 


Pro Phe 


Tyr 


Glu 


Glu 


Thr Glu Ser 










245 








250 






255 


Lys 


Leu 


Phe 


Glu 


Lys 


He 


Lys 


Glu Gly 


Tyr 


Tyr 


Glu 


Phe Glu Ser 










260 






265 






270 


Pro 


Phe 


Trp 


Asp 


Asp 


He 


Ser 


Glu Ser 


Ala 


Lys 


Asp 


Phe He Cys 










275 








280 




285 


His 


Leu 


Leu 


GlU 


Lys 


Asp 


Pro 


Asn Glu 


Gly 


Val 


Thr 


Leu 



122/228 



WO 01/62927 



290 295 

<210> 241 
<211> 133 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 338927 . l.orfl :2000MAY01 
<400> 241 



Asp 


Pro 


Pro 


Arg Glu 


Thr 


Gly 


Arg Met 


Lys 


Gly 


Ala Ser Glu Glu 


1 






5 








10 




15 


Lys 


Leu 


Ala 


Ser Val 


Ser 


Asn 


Leu Val 


Thr 


Val 


Phe Glu Asn Ser 








20 








25 




30 


Arg 


Thr 


Pro 


Glu Ala 


Ala 


Pro 


Arg Gly 


Gin 


Arg 


Leu Glu Asp Val 








35 








40 




45 


His 


His 


Arg 


Pro Glu 


Cys 


Arg 


Pro Pro 


Glu 


Ser 


Pro Gly Pro Arg 








50 








55 




60 


Glu 


Lys 


Thr 


Asn Val' 


Gly 


Glu 


Ala Val 


Gly 


Ser 


Glu Pro Arg Thr 








65 








70 




75 


Val 


Ser 


Arg 


Arg Tyr 


Leu 


Asn 


Ser Leu 


Lys 


Asn 


Lys Leu Ser Ser 








80 








85 




90 


Glu 


Ala 


Trp 


Arg Lys 


Ser 


Leu 


Pro Ala 


Cys 


Asp 


Pro Leu Arg lie 








95 








100 




105 


Gly 


Asp 


Ala 


Gly Ala 


Arg 


Glu 


Glu Asp 


Arg 


Pro 


Gly Ala Ala Trp 








110 








115 




120 


Arg 


His 


Asp 


Ala Gly 


His 


Met 


Trp Arg 


Ala 


Ser 


Thr Cys 



125 130 

<210> 242 
<211> 354 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG:335558 . 1 .orf 2 :2000FEB18 
<220> 

<221> unsure 
<222> 341 

<223> unknown or other 
<400> 242 



Leu 


He 


Gly 


Val 


Leu 


Gin 


Val 


Leu Gin 


Val 


Glu 


Leu Gly He Asn 


1 








5 








10 




15 


Ser 


Val 


Thr 


Gly 


Thr 


Ser 


Thr 


Val Asn 


Asn 


Val 


Asn He Thr Ala 










20 








25 




30 


Val 


Gly 


Ser 


Phe 


Asn 


Pro 


Asn 


Val Thr 


Ser 


Ser 


Met Leu Gly Asn 










35 








40 




45 


Val 


Asn 


He 


Ser 


Thr 


Ser 


Asn 


He Pro 


Ser 


Ala 


Ala Gly Val Ser 










50 








55 




60 


Val 


Gly 


Pro 


Gly 


Val 


Thr 


Ser 


Gly Val 


Asn 


Val 


Asn He Leu Ser 










65 






70 




75 


Gly 


Met 


Gly 


Asn 


Gly 


Thr 


He 


Ser Ser 


Ser 


Ala 


Ala Val Ser Ser 










80 








85 




90 


Val 


Pro 


Asn 


Ala 


Ala 


Ala 


Gly 


Met Thr 


Gly 


Gly 


Ser Val Ser Ser 










95 








100 




105 


Gin 


Gin 


Gin 


Gin 


Pro 


Thr 


Val 


Asn Thr 


Ser 


Arg 


Phe Arg Val Val 










110 








115 


120 


Lys 


Leu 


Asp 


Ser 


Ser 


Ser 


Glu 


Pro Phe 


Lys 


Lys 


Gly Arg Trp Thr 










125 








130 




135 


Cys 


Thr 


Glu 


Phe 


Tyr 


Glu 


Lys 


Glu Asn 


Ala 


Val 


Pro Ala Thr Glu 










140 






145 




150 


Gly 


Val 


Leu 


He 


Asn 


Lys 


Val 


Val Glu 


Thr 


Val 


Lys Gin Asn Pro 
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155 








160 




165 


lie 


Glu 


Val 


Thr 


Ser 


Glu 


Arg 


Glu Ser 


Thr 


Ser Gly 


Ser Ser Val 










170 






175 




180 


Ser 


Ser 


Ser Val 


Ser 


Thr 


Leu 


Ser His 


Tyr 


Thr Glu 


Ser Val Gly 










185 








190 




195 


Ser 


Gly 


Glu Met 


Gly 


Ala 


Pro 


Thr Val 


Val 


Val Gin 


Gin Gin Gin 










200 








205 




210 


Gin 


Gin 


Gin Arg 


Leu 


Leu 


Gin 


Gin Gin 


Pro 


Ala Leu 


Gin Gly Val 










215 








220 




225 


Thr 


Leu 


Gin Gin 


Met 


Asp 


Phe 


Gly Ser 


Thr 


Gly Pro 


Gin Ser He 










230 






235 




240 


Pro 


Ala 


Val 


Ser 


He 


Pro 


Gin 


Ser He 


Ser 


Gin Ser 


Gin He Ser 










245 








250 




255 


Gin 


Val 


Gin 


Leu 


Gin 


Ser 


Gin 


Glu Leu 


Ser 


Tyr Gin 


Gin Lys Gin 










260 








265 




270 


Gly 


Leu 


Gin 


Pro 


Val 


Pro 


Leu 


Gin Ala 


Thr 


Met Ser 


Ala Ala Thr 








275 








280 




285 


Gly 


lie 


Gin 


Pro 


Ser 


Pro 


Val 


Asn Val 


Val 


Gly Val 


Thr Ser Ala 










290 








295 


300 


Leu 


Gly 


Gin 


Gin 


Pro 


Ser 


He 


Ser Ser 


Leu 


Ala Gin 


Pro Gin Leu 








305 








310 




315 


Pro 


Tyr 


Ser 


Gin 


Ala 


Ala 


Pro 


Pro Val 


Gin 


Thr Pro 


Leu Pro Gly 










320 








325 




330 


Ala 


Pro 


Pro 


Pro 


Gin 


Gin 


Leu 


Gin Tyr 


Glv 


Xaa Gin 


Gin Pro Met 










335 






340 




345 


Val 


Ser 


Thr 


Gin 


Met 


Ala 


Pro 


Gly Met 









350 

<210> 243 
<211> 237 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG.-998283 . 7 .orf 1;2000PEB18 
<400> 243 



Leu 


Tyr 


Cys 


He 


Cys 


Lys 


Thr 


Pro Tyr 


Asp 


Glu 


Ser 


Lys Phe Tyr 


1 








5 








10 






15 


He 


Gly 


Cys 


Asp 


Leu 
20 


Cys 


Thr 


Asn Trp 


Tyr 
25 


His 


Gly 


Glu cys Val 
30 


Gly 


He 


Thr 


Glu 


Lys 
35 


Glu 


Ala 


Lys Lys 


Met 
40 


Asp 


Val 


Tyr He Cys 
45 


Asn 


Asp 


Cys 


Lys 


Arg 
50 


Ala 


Gin 


Glu Gly 


Ser 
55 


Ser 


Glu 


Glu Leu Tyr 
60 


Cys 


He 


Cys 


Arg 


Thr 
65 


Pro 


Tyr 


Asp GlU 


Ser 
70 


Gin 


Phe 


Tyr He Gly 
75 


Cys 


Asp 


Arg 


Cys 


Gin 
80 


Asn 


Trp 


Tyr His 


Gly 
85 


Arg 


Cys 


Val Gly He 
90 


Leu 


Gin 


Ser 


Glu 


Ala 
95 


Glu 


Leu 


He Asp 


Glu 
100 


Tyr 


Val 


Cys Pro Gin 
105 


Cys 


Gin 


Ser 


Thr 


Glu 
110 


Asp 


Ala 


Met Thr 


Val 
115 


Leu 


Thr 


Pro Leu Thr 
120 


Glu 


Lys 


Asp 


Tyr 


Glu 
125 


Gly 


Leu 


Lys Arg 


Val 
130 


Leu 


Arg 


Ser Leu Gin 
135 


Ala 


His 


Lys 


Met 


Ala 
140 


Trp 


Pro 


Phe Leu 


Glu 
145 


Pro 


Val 


Asp Pro Asn 
150 


Asp 


Ala 


Pro 


Asp 


Tyr 
155 


Tyr 


Gly 


Val He 


Lys 
160 


Glu 


Pro 


Met Asp Leu 
165 


Ala 


Thr 


Met 


GlU 


Glu 
170 


Arg 


Val 


Gin Arg 


Arg 
175 


Tyr 


Tyr 


Glu Lys Leu 
180 


Thr 


Glu 


Phe 


Val 


Ala 
185 


Asp 


Met 


Thr Lys 


He 
190 


Phe 


Asp 


Asn Cys Arg 
195 


Tyr 


Tyr 


Asn 


Pro 


Ser 
200 


Asp 


Ser 


Pro Phe 


Tyr 
205 


Gin 


Cys 


Ala Glu Val 
210 


Leu 


Glu 


Ser 


Phe 


Phe 


Val 


Gin 


Lys Leu 


Lys 


Gly 


Phe 


Lys Ala Ser 
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215 220 225 

Arg Ser His Asn Asn Lys Leu Gin Ser Thr Ala Ser 
230 235 

<210> 244 
<211> 161 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID NO: LI:402739 . 1 .orf 1:2000FEB01 
<400> 244 



Pro 


Pro 


Trp 


Gly Gin 


Arg 


Ser Pro Thr Pro 


Pro 


Ser Asp Thr Gly 


1 






5 




10 




15 


Gly 


Thr 


Ser 


Arg Pro 


Arg 


Thr Met He Pro 


Pro 


Gly Glu Cys Thr 








20 




25 




30 


Tyr 


Ala 


Gly 


Arg Lys 


Arg 


Arg Arg Pro Leu 


Gin 


Lys Gin Arg Pro 








35 




40 




45 


Ala 


Val 


Gly 


Ala Glu 


Lys 


Ser Asn Pro Ser 


Lys 


Arg His Arg Asp 








50 




55 




60 


Arg 


Leu 


Asn 


Ala Glu 


Leu 


Asp His Leu Ala 


Ser 


Leu Leu Pro Phe 






65 




70 




75 


Pro 


Pro 


Asp 


He He 


Ser 


Lys Leu Asp Lys 


Leu 


Ser Val Leu Arg 








80 




85 




90 


Leu 


Ser 


Val 


Ser Tyr 


Leu 


Arg Val Lys Ser 


Phe 


Phe Gin Gly Gin 








95 




100 




105 


Gly 


Leu 


Ala 


Val Ala 


Asp 


Ala Glu Asp Val 


Asp 


Asp His Thr Gly 








110 




115 




120 


Glu 


Arg 


Arg 


Pro Met 


Ser 


Phe Arg Arg Pro 


Arg 


Ala Leu Asp Thr 








125 




130 




135 


Gin 


Ala 


Leu 


Arg Arg 


Thr 


Gin Phe Gly Leu 


His 


Leu Leu Met Val 








140 




145 




150 


Asn 


He 


Ala 


Gly Leu 


He 


Ala Thr Asp Arg 


Leu 










155 




160 







<210> 245 
<211> 151 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 175223 . 1 .orf 3 : 2000FEB01 
<400> 245 



He 


Asp 


Ala 


Glu 


Asp 


His 


Ser 


Val Pro 


Lys 


Gly 


Lys Phe Ser Ser 


1 








5 








10 




15 


His 


Glu 


Phe 


Gly 


Ala 


Glu 


Gly 


Pro Trp 


Gly 


Asn 


Met Ala Glu Gly 










20 








25 




30 


Gly 


Ala 


Ser 


Lys 


Gly 


Gly 


Gly 


Glu Glu 


Pro 


Gly 


Lys Leu Pro Glu 










35 








40 




45 


Pro 


Ala 


Glu 


Glu 


Glu 


Ser 


Gin 


Val Leu 


Arg 


Gly 


Thr Gly His Cys 










50 








55 




60 


Lys 


Trp 


Phe 


Asn 


Val 


Arg 


Met 


Gly Phe 


Gly 


Phe 


He Ser Met He 










65 








70 




75 


Asn 


Arg 


Glu 


Gly 


Ser 


Pro 


Leu 


Asp He 


Pro 


Val 


Asp Val Phe Val 










'80 








85 




90 


His 


Gin 


Ser 


Lys 


Leu 


Phe 


Met 


Glu Gly 


Phe 


Arg 


Ser Leu Lys Glu 








95 






100 




105 


Gly 


Glu 


Pro 


Val 


Glu 


Phe 


Thr 


Phe Lys 


Lys 


Ser 


Ser Lys Gly Leu 










110 








115 




120 


Glu 


Ser 


lie 


Arg 


Val 


Thr 


Gly 


Pro Gly 


Gly 


Ser 


Pro Cys Leu Gly 










125 








130 




135 


Ser 


Glu 


Arg 


Arg 


Pro 


Lys 


Gly 


Lys Thr 


Leu 


Gin 


Lys Arg Lys pro 










140 








145 




150 
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PCT/DS01/06059 



Lys 



<210> 246 
<211> 160 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 981076 . 2 .orf 2 :2000MAY19 
<220> 

<221> unsure 
<222> 157 

<223> unknown or other 
<400> 246 



Met 


Ala 


Ser Lys 


Val 


Thr 


Asp 


Ala He 


Val 


Trp 


Tyr Gin Lys 


Lys 


1 






5 








10 






15 


He 


Gly 


Ala Tyr 


Asp 


Gin 


Gin 


He Trp 


Glu 


Lys 


Ser Val Glu 


Gin 








20 








25 






30 


Arg 


Glu 


He Lys 


Gly 


Leu 


Arg 


Asn Lys 


Pro 


Lys 


Lys Thr Ala 


His 








35 








40 






45 


val 


Lys 


Pro Asp 


Leu 


He 


Asp 


Val Asp 


Leu 


Val 


Arg Gly Ser 


Ala 








50 








55 






60 


Phe 


Ala 


Lys Ala 


Lys 


Pro 


Glu 


Ser Pro 


Trp 


Thr 


Ser Leu Thr 


Arg 








65 








70 






75 


Lys 


Gly 


He Val 


Arg 


Val 


Val 


Phe Phe 


Pro 


Phe 


Phe Phe Arg 


Trp 








80 








85 






90 


Trp 


Leu 


Gin Val 


Thr 


Ser 


Lys 


Val He 


Phe 


Phe 


Trp Leu Leu 


Val 








95 








100 






105 


Leu 


Tyr 


Leu Leu 


Gin 


Val 


Ala 


Val He 


Val 


Leu 


Phe Cys Ser 


Thr 








110 








115 






120 


Ser 


Ser 


Pro His 


Ser 


He 


Pro 


Leu Thr 


Glu 


Val 


He Gly Pro 


He 








125 








130 






135 


Trp 


Leu 


Met Leu 


Leu 


Leu 


Gly 


Thr Val 


His 


Cys 


Gin He Val 


Ser 








140 








145 




150 


Thr 


Arg 


Thr Pro 


Lys 


Pro 


Xaa 


Leu Ser 


Thr 









155 160 

<210> 247 
<211> 160 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI :1008973 . 1 .orf3 :2000MAY01 
<400> 247 



Leu 


His 


He 


Pro 


Arg 


Ser 


Pro 


Pro 


Gly 


Asp 


Arg 


Ala Ala Arg 


Thr 


1 








5 










10 






15 


Gly 


His 


Pro 


Arg 


Leu 
20 


Pro 


Val 


Pro 


Pro 


Pro 
25 


Arg 


Ala Arg Thr 


Glu 
30 


Pro 


Arg 


Pro 


Arg 


Gly 
35 


Gin 


Arg 


Arg 


Leu 


His 
40 


Ser 


Ser Gly Glu 


Met 
45 


Ala 


Ala 


Gly 


Ser 


Thr 


Thr 


Leu 


Arg 


Ala 


Val 


Gly 


Lys Leu Gin 


Val 










50 










55 


60 


Arg 


Leu 


Ala 


Thr 


Lys 
65 


Thr 


Glu 


Pro 


Lys 


Lys 
70 


Leu 


Glu Lys Tyr 


Leu 
75 


Gin 


Lys 


Leu 


Ser 


Ala 
80 


Leu 


Pro 


Met 


Thr 


Ala 
85 


Asp 


lie Leu Ala 


Glu 
90 


Thr 


Gly 


Leu 


Arg 


Lys 
95 


Thr 


Val 


Lys 


Arg 


Leu 
100 


Arg 


Lys His Gin 


His 
105 


Val 


Gly 


Asp 


Phe 


Ala 
110 


Arg 


Asp 


Leu 


Ala 


Ala 
115 


Arg 


Trp Lys Lys 


Leu 
120 
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Val Leu Val Asp Arg Asn Thr Gly Pro Asp Pro Gin Asp Pro Glu 
125 130 135 

Glu Ser Ala Ser Arg Gin Arg Tyr Gly Glu Ala Leu Gin Glu Arg 
140 145 150 

Glu Lys Gly Trp Gly Leu Pro Arg Lys Arg 
155 160 

<210> 248 
<211> 171 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI:1190250 . 1 .orf 1 :2000MAY01 
<400> 248 



Arg 


Ala 


His Ser 


Gly 


Val 


Leu 


Met 


Ser 


Ala 


Met 


Leu Ser His Gly 


1 






5 










10 




15 


Val 


Leu 


Lys Arg 


Ala 


Ser 


Glu 


Arg 


Gly 


Ala 


Glu 


Arg His Ser Leu 








20 










25 




30 


Pro 


Pro 


Ser Arg 


Leu 


Val 


Leu 


Val 


Pro 


Gly 


Arg 


Arg Ala Leu Arg 








35 










40 




45 


Ser 


Ala 


Pro Gin 


Val 


Pro 


Gly 


Ser 


Gly 


Trr> 


Arg 


Val Gly Thr Glu 








50 








55 




60 


Pro 


Pro 


Val Leu 


His 


Asp 


Pro 


Ala 


Gly 


Arg 


Gly 


Arg Phe Pro Gin 








65 










70 




75 


Ser 


Gly 


Glu Val 


Ser 


Ala 


Ala 


Pro 


GlU 


Met 


Ser 


Lys Leu Ser Phe 








80 










85 




90 


Arg 


Ala 


Arg Ala 


Leu 


Asp 


Ala 


Ser 


Lys 


Pro 


Leu 


Pro Val Phe Arg 








95 










100 




105 


cys 


GlU 


Asp Leu 


Pro 


Asp 


Leu 


His 


Glu 


Tyr 


Ala 


Ser He Asn Arg 








110 










115 




120 


Ala 


Val 


Pro Gin 


Met 


Pro 


Thr 


Gly 


lie 


Glu 


Lys 


Glu Glu Glu Ser 








125 








130 




135 


Glu 


His 


His Leu 


Pro 


Ala 


Gly 


Leu 


Phe 


Gin 


His 


Ser Arg Cys Met 








140 










145 




150 


Ala 


Arg 


Arg Gly 


lie 


Tyr 


Met 


Val 


lie 


Pro 


Val 


Pro Glu Ala Glu 








155 










160 




165 


Ser 


Asn 


Tyr Cys 


Leu 


Leu 















170 

<210> 249 
<211> 449 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 021371. 3 . orf 2 :2000FEB18 
<400> 249 



Pro 


Gly 


Met 


Ser 


Val 


Ala Gly Val Glu 


Gly 


Glu Pro Leu Val Ser 


1 






5 


10 


15 


Ser 


Gin 


Ser 


Gly 


Gin 


Ser Pro Pro Glu 


Pro 


Gin Asp Pro Glu Ala 










20 




25 


30 


Pro 


Ser 


Ser 


Ser 


Gly 


Pro Gly His Leu 


Val 


Ala Met Gly Lys Val 










35 




40 


45 


Ser 


Arg 


Thr 


Pro 


Val 


Glu Ala Gly Val 


Ser 


Gin Ser Asp Ala Glu 










50 


55 


. 60 


Asn 


Ala 


Ala 


Pro 


Ser 


Cys Pro Asp Glu 


His 


Asp Thr Leu Pro Arg 










65 


70 


75 


Arg 


Arg 


Gly 


Arg 


Pro 


Ser Arg Arg Phe 


Leu 


Gly Lys Lys Tyr Arg 










80 




85 


90 


Lys 


Tyr 


Tyr 


Tyr 


Lys 


Ser Pro Lys Pro 


Leu 


Leu Arg Pro Phe Leu 










95 




100 


105 


Cys 


Arg 


He 


Cys 


Gly 


Ser Arg Phe Leu 


Ser 


His Glu Asp Leu Arg 



127/228 

i 



WO 01/62927 



PCT/US01/06059 



110 115 120 



Phe 


His 


Val 


Asn Ser 
125 


His 


Glu 


Ala 


Gly 


Asp 
130 


Pro 


Gin 


Leu Phe 


Lys 
135 


Cys 


Leu 


Gin 


Cys Ser 
140 


Tyr 


Arg 


Ser 


Arg 


Arg 
145 


Trp 


Ser 


Ser Leu 


Lys 
150 


GlU 


His 


Met 


Phe Asn 
155 


His 


Val 


Gly 


Ser 


Lys 
160 


Pro 


Tyr 


Lys Cys 


Asp 
165 


GlU 


Cys 


Ser 


Tyr Thr 
170 


Ser 


Val 


Tyr 


Arg 


Lys 
175 


Asp 


Val 


He Arg 


His 
180 


Ala 


Ala 


Val 


His Ser 
185 


Arg 


Asp 


Arg 


Lys 


Lys 
190 


Arg 


Pro 


Asp Pro 


Thr 
195 


Pro 


Lys 


Leu 


Ser Ser 
200 


Phe 


Pro 


Cys 


Pro 


Val 
205 


Cys 


Gly 


Arg Val 


Tyr 
210 


Pro 


Met 


Gin 


Lys Arg 
215 


Leu 


Thr 


Gin 


His 


Met 
220 


Lys 


Thr 


His Ser 


Thr 
225 


Glu 


Lys 


Pro 


His Met 
230 


Cys 


Asp 


Lys 


Cys 


Gly 
235 


Lys 


Ser 


Phe Lys 


Lys 
240 


Arg 


Tyr 


Thr 


Phe Lys 
245 


Met 


His 


Leu 


Leu 


Thr 
250 


His 


He 


Gin Ala 


Val 
255 


Ala 


Asn 


Arg 


Arg Phe 
260 


Lys 


Cys 


Glu 


Phe 


Cys 
265 


Glu 


Phe 


Val Cys 


Glu 
270 


Asp 


Lys 


Lys 


Ala Leu 
275 


Leu 


Asn 


His 


Gin 


Leu 
280 


Ser 


His 


Val Ser 


Asp 
285 


Lys 


Pro 


Phe 


Lys Cys 
290 


Ser 


Phe 


cys 


Pro 


Tyr 
295 


Arg 


Thr 


Phe Arg 


Glu 
300 


Asp 


Phe 


Leu 


Leu Ser 
3 05 


His 


Val 


Ala 


Val 


Lys 
310 


His 


Thr 


Gly Ala 


Lys 
315 


Pro 


Phe 


Ala 


Cys Glu 
320 


Tyr 


Cys 


His 


Phe 


Ser 
325 


Thr 


Arg 


His Lys 


Lys 
330 


Asn 


Leu 


Arg 


Leu His 

'335 


Val 


Arg 


Cys 


Arg 


His 
340 


Ala 


Ser 


Ser Phe 


Glu 
345 


Glu 


Trp 


Gly 


Arg Arg 
350 


His 


Pro 


Glu 


Glu 


Pro 
355 


Pro 


Ser 


Arg Arg 


Arg 
360 


Pro 


Phe 


Phe 


Ser Leu 


Gin 


Gin 


He 


Glu 


Glu 


Leu 


Lys 


Gin Gin 


His 








3 65 










370 






375 


Ser 


Ala 


Ala 


Pro Gly 
380 


Pro 


Pro 


Pro 


Ser 


Ser 
385 


Pro 


Gly 


Pro Pro 


Glu 
390 


He 


Pro 


Pro 


Glu Ala 
395 


Thr 


Thr 


Phe 


Gin 


Ser 
400 


Ser 


Glu 


Ala Pro 


Ser 
405 


Leu 


Leu 


Cys 


Ser Asp 
410 


Thr 


Leu 


Gly 


Gly 


Ala 
415 


Thr 


He 


He Tyr 


Gin 
420 


Gin 


Gly 


Ala 


Glu Glu 


Ser 


Thr 


Ala 


Met 


Ala 


Thr 


Gin 


Thr Ala 


Leu 






425 










430 








435 


Asp 


Leu 


Leu 


Leu Asn 


Met 


Ser 


Ala 


Gin 


Arg 


Glu 


Leu 


Gly Gly 





440 445 

<210> 250 
<211> 127 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LG : 475404. l.orf 2 :2000FEB18 
<400> 250 



Leu Thr Pro 


Gly 


His 


Pro Gly Ser 


Arg Gly Met 


Asp 


Ser Val 


Ala 


1 




5 




10 






15 


Phe Glu Asp 


Val 


Ala 


Val Asn Phe 


Thr Gin Glu 


Glu 


Trp Ala 


Leu 






. 20 




25 




30 


Leu Asp Ser 


Ser 


Gin 


Lys Asn Leu 


Tyr Arg Glu 


Val 


Met Gin 


Glu 






35 


40 






45 


Thr Cys Arg 


Asn 


Leu 


Ala Ser Val 


Gly Ser Gin 


Trp 


Lys Asp 


Gin 






50 




55 




60 


Asn He Glu 


Asp 


His 


Phe Glu Lys 


Pro Gly Lys 


Asp 


He Arg 


Asn 






65 




70 






75 


His He Val 


Gin 


Arg 


Leu Cys Glu 


Ser Lys Glu 


Asp 


Gly Gin 


Tyr 



128/228 



WO 01/62927 
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80 85 90 

Gly Glu Val Val Ser Gin lie Pro Asn Leu Asp Leu Asn Glu Asn 
95 100 105 

He Ser Thr Gly Leu Lys Pro Cys Glu Cys Ser He Cys Gly Lys 
110 115 120 

Val Phe Val Arg His Ser Leu 
125 

<210> 251 
<211> 157 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 979406 . 2 .orfl :2000FEB18 



<220> 

<221> unsure 
<222> 136 

<223> unknown or other 



<400> 251 



Asn 


Ser Leu Ser 


Val 


Ala Ser Ala Pro 


Pro 


Gin 


Arg Asp Pro Gly 


1 




5 




10 




15 


Met 


Ala Met Ala 


Leu 


Pro Met Pro Gly 


Pro 


Gin 


Glu Ala Val Val 






20 


25 




30 


Phe 


Glu Asp Val 


Ala 


Val Tyr Phe Thr 


Arg 


He 


Glu Trp Ser Cys 






35 




40 




45 


Leu 


Ala Pro Asp 


Gin 


Gin Ala Leu Tyr 


Arg 


Asp 


Val Met Leu Glu 






50 


55 




60 


Asn 


Tyr Gly Asn 


Leu 


Ala Ser Leu Gly 


Phe 


Leu 


Val Ala Lys Pro 




65 


70 




75 


Ala 


Leu lie Ser 


Leu 


Leu Glu Gin Gly 


Glu 


Glu 


Pro Gly Ala Leu 






80 


85 




90 


He 


Leu Gin Val 


Ala 


Glu Gin Ser Val 


Ala 


Lys 


Ala Ser Leu Cys 






95 




100 




105 


Thr 


Glu Asp Pro 


Asn 


Thr Leu Pro Ser 


Arg 


Ser 


Gin Glu Gly Ser 




110 




115 




120 


Pro 


Ala Ser Ser 


Glu 


Gly Gly Pro Gly 


Glu 


Lys 


Gly Val Ala Gly 






125 


130 




135 


Xaa 


Val Ala Gly 


Gly 


Gly Ala Ala Ser 


Ser 


Trp 


Pro His Gly Glu 






140 




145 




150 


His 


Pro Val Thr 


Pro 


Asn Arg 












155 











<210> 252 
<211> 305 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 410726. l.orf 1:2000FEB18 
<400> 252 



Asp Thr 


Met 


Gin 


Ala 


Val Val Pro Leu 


Asn 


Lys 


Met Thr Ala He 


1 






5 




10 




15 


Ser Pro 


Glu 


Pro 


Gin 


Thr Leu Ala Ser 


Thr 


Glu 


Gin Asn Glu Val 








20 




25 




30 


Pro Arg 


Val 


Val 


Thr 


Ser Gly Glu Gin 


Glu 


Ala 


He Leu Arg Gly 








35 




40 




45 


Asn Ala 


Ala 


Asp 


Ala 


Glu Ser Phe Arg 


Gin 


Arg 


Phe Arg Trp Phe 








50 




55 




60 


Cys Tyr 


Ser 


GlU 


Val 


Ala Gly Pro Arg 


Lys 


Ala 


Leu Ser Gin Leu 








65 




70 




75 


Trp Glu 


Leu 


Cys 


Asn 


Gin Trp Leu Arg 


Pro 


Asp 


He His Thr Lys 
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60 65 90 



Glu 


Gin 


lie Leu 


Glu. Leu 


Leu 


Val Phe 


Glu 


Gin 


Phe 


Leu Thr He 








95 








100 






105 


Leu 


Pro 


Gly Glu 


lie 


Arg 


He 


Trp Val 


Lys 


Ser 


Gin 


His Pro Glu 








110 








115 






120 


Ser 


Ser 


Glu Glu 


Val 


Val 


Thr 


Leu He 


Glu 


Asp 


Leu 


Thr Gin Met 








125 








130 






135 


Leu 


Glu 


Glu Lys 


Asp 


Pro 


Val 


Ser Gin 


Asp 


Ser 


Thr 


Val Ser Gin 






140 








145 






150 


Glu 


Glu 


Asn Ser 


Lvs 


Glu 


Asp 


Lys Met 


Val 


Thr 


Val 


Cys Pro Asn 








155 






160 






165 


Thr 


Glu 


Ser Cys 


Glu 


Ser 


He 


Thr Leu 


Lys 


Asp 


Val 


Ala Val Asn 








170 








175 






180 


Phe 


Ser 


Arg Gly 


Glu 


Ttd 


Lys 


Lys Leu 


Glu 


Pro 


Phe 


Gin Lys Glu 








185 








190 






195 


Leu 


Tyr 


Lys Glu 


Val 


Leu 


Leu 


Glu Asn 


Leu 


Arg 


Asn 


Leu Glu Phe 




200 








205 






210 


Leu 


Asp 


Phe Pro 


Val 


Ser 


Lys 


Leu Glu 


Leu 


He 


Ser 


Gin -Leu Lys 






215 






220 






225 


Trp 


Val 


Glu Leu 


Pro 


Trp 


Leu 


Leu Glu 


Glu 


Val 


Ser 


Lys Ser Ser 






230 






235 






240 


Arg 


Leu 


Asp Glu 


Ser 


Ala 


Leu 


Asp Lys 


He 


He 


Glu 


Arg Cys Leu 








245 








250 






255 


Arg 


Asp 


Asp Asp 


His 


Gly 


Leu 


Met Glu 


Glu 


Ser 


Gin 


Gin Tyr Cys 








260 








265 






270 


Gly 


Ser 


Ser Glu 


Glu 


Asp 


His 


Gly Asn 


Gin 


Gly 


Asn 


Ser Lys Gly 








275 








280 






285 


Arg 


Val 


Ala Gin 


Tyr 


Lys 


Thr 


Leu Gly 


Ser 


Gly 


Ser 


Arg Gly Lys 








290 








295 






300 


Lys 


Phe 


Asp Pro 


Asp 

















305 

<210> 253 
<211> 717 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 200005 . 1 .orf 2 :2000FEB18 
<400> 253 



Glu 


Cys 


Ser 


Arg 


Val 


Thr 


Val 


Thr Glu 


His 


Trp 


Ser 


Lys Val Phe 


1 








5 








10 






15 


Pro 


Lys 


Gly 


Gin 


Gly 


Ser 


Gin 


Glu His 


Leu 


Leu 


Lys 


Leu Met Thr 










20 








25 






30 


Met 


Gly 


Asp 


Met 


Lys 


Thr 


Pro 


Asp Phe 


Asp 


Asp 


Leu 


Leu Ala Ala 










35 








40 






45 


Phe 


Asp 


He 


Pro 


Asp 


Met 


Val 


Asp Pro 


Lys 


Ala 


Ala 


He Glu Ser 










50 








55 






60 


Gly 


His 


Asp 


Asp 


His 


Glu 


Ser 


His Met 


Lys 


Gin 


Asn 


Ala His Gly 










65 








70 






75 


Glu 


Asp 


Asp 


Ser 


His 


Ala 


Pro 


Ser Ser 


Ser 


Asp 


Val 


Gly Val Ser 










80 








85 






90 


Val 


He 


Val 


Lys 


Asn 


Val 


Arg 


Asn He 


Asp 


Ser 


Ser 


Glu Gly Gly 










95 








100 






105 


Glu 


Lys 


Asp 


Gly 


His 


Asn 


Pro 


Thr Gly 


Asn 


Gly 


Leu 


His Asn Gly 










110 








115 






120 


Phe 


Leu 


Thr 


Ala 


Ser 


Ser 


Leu 


Asp Ser 


Tyr 


Ser 


Lys 


Asp Gly Ala 










125 






130 






135 


Lys 


Ser 


Leu 


Lys 


Gly 


Asp 


Val 


Pro Ala 


Ser 


Glu 


Val 


Thr Leu Lys 








140 






145 






150 


Asp 


Ser 


Thr 


Phe 


ser 


Gin 


Phe 


Ser Pro 


He 


Ser 


Ser 


Ala Glu Glu 








155 








160 






165 


Phe 


Asp 


Asp 


Asp 


Glu' Lys 


He 


Glu Val 


Asp 


Asp 


Pro 


Pro Asp Lys 










170 








175 






180 


Glu 


Asp 


Met 


Arg 


Ser 


Ser 


Phe 


Arg Ser 


Asn 


Val 


Leu 


Thr Gly Ser 
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185 



Ala 


Pro 


Gin 


Gin 


Asp 


Tyr 


Asp 










200 






Asn 


Ser 


Ser 


Lys 


Thr 


Gly 


Leu 










215 




Asn 


Lys 


Ala 


Val 


Lys 


Arg 


Glu 










230 






Ser 


Val 


Tyr 


Glu 


Pro 


Phe 


Lys 










245 






Lys 


Glu 


Ser 


Ser 


Asp 


Lys 


Val 










260 






Lys 


Leu 


Ser 


Ser 


Glu 


Lys 


Asn 










275 






Pro 


Ser 


Lys 


Thr 


Lys 


Ser 


Ser 










290 






Ala 


He 


Ala 


Ala 


Leu 


Ser 


Ala 










305 






Lys 


Glu 


Pro 


Val 


Ala 


Asn 


Ser 








320 






Glu 


Val 


Asn 


Asp 


Ser 


Pro 


Arg 










335 






Gin 


Asn 


Leu 


He 


Asp 


Gly 


Thr 










350 






Asp 


Ser 


Pro 


Arg 


Ser 


He 


Ser 










365 






Pro 


Ser 


Ser 


Pro 


Ala 


Gly 


Ser 










380 






lie 


Lys 


Thr 


He 


Lys 


Thr 


Ser 








395 






Thr 


Arg 


Val 


Leu 


Pro 


Glu 


Val 










410 






Ser 


Glu 


Gin 


Thr 


Ala 


Ser 


Val 










425 






Ser 


Ser 


Pro 


Ala 


Ser 


Ala 


Ala 










440 






Pro 


Leu 


Gin 


Ser 


Ala 


Val 


Val 










455 






Leu 


Thr 


Pro 


Lys 


Gin 


Val 


Thr 










470 






Leu 


Pro 


Val 


Ser 


Ala 


Val 


Lys 










485 




Leu 


Lys 


Leu 


Ala 


Asn 


Asn 


Thr 








500 






Ala 


Ala 


Ser 


Val 


Gin 


Ser 


Ala 










515 






Asn 


Ala 


He 


Gin 


Gin 


Gin 


Thr 










530 






Ala 


Asn 


Ala 


Lys 


Leu 


Val 


Pro 








545 






Asn 


Leu 


Leu 


Pro 


Gin 


Gly 


Ala 










560 




Val 


Leu 


Thr 


Lys 


Pro 


Gin 


Gin 








575 






Ala 


Ala 


Ala 


Ser 


Gin 


Pro 


Pro 










590 






Val 


Ser 


Ser 


Leu 


Gin 


Ser 


Ser 










605 






Leu 


Ser 


Ser 


Val 


Asn 


Pro 


Val 










620 






Pro 


Pro 


Ala 


Asn 


Ala 


Gly 


He 










635 




Cys 


Leu 


Glu 


Cys 


Gly 


Asp 


Ser 










650 






Gin 


His 


Tyr 


Asp 


Arg 


Arg 


Ser 










665 






His 


Cys 


Thr 


Lys 


Asn 


Leu 


Val 



680 







190 


195 


Lys 


Leu 


Lys Ala Leu Gly Gly 


Glu 




205 


210 


Ser 


Thr 


Ser Gly Asn Val Glu 


Lys 






220 


225 


Thr 


Glu 


Ala Ser Ser He Asn 


Leu 






235 


240 


Val 


Arg 


Lys Ala Glu- Asp Lys 


Leu 






250 


255 


Leu 


Glu 


Asn Arg Val Leu Asp 


Gly 






265 


270 


Asp 


Thr 


Ser Leu Pro Ser Val 


Ala 




280 


285 


Ser 


Lys 


Leu Ser Ser Cys He 


Ala 




295 


300 


Lys 


Lys 


Ala Ala Ser Asp Ser 


Cys 






310 


315 


Arg 


Glu 


Ser Ser Pro Leu Pro 


Lys 




325 


330 


Ala 


Ala 


Asp Lys Ser Pro Glu 


Ser 






340 


345 


Lys 


Lys 


Pro Ser Leu Lys Gin 


Pro 






355 


360 


Ser 


Glu 


Asn Ser Ser Lvs Gly 


Ser 






370 


375 


Thr 


Pro 


Ala He Pro Lys Val 


Arg 






385 


390 


Ser 


Gly 


Glu He Lys Arg Thr 


Val 






400 


405 


Asp 


Leu 


Asp Ser Gly Lys Lys 


Pro 






415 


420 


Met 


Ala 


Ser Val Thr Ser Leu 


Leu 






430 


435 


Val 


Leu 


Ser Ser Pro Pro Arg 


Ala 






445 


450 


Thr 


Asn 


Ala Val Ser Pro Ala 


Glu 






460 


465 


He 


Lys 


Pro Val Ala Thr Ala 


Phe 




475 


480 


Thr 


Ala 


Gly Ser Gin Val He 


Asn 






490 


495 


Thr 


Val 


Lys Ala Thr Val He 


Ser 






505 


510 


Ser 


Ser 


Ala He He Lys Ala 


Ala 






520 


525 


Val 


Val 


Val Pro Ala Ser Ser 


Leu 






535 


540 


Lys 


Thr 


Val His Leu Ala Asn 


Leu 




550 


555 


Gin 


Ala 


Thr Ser Glu Leu Arg 


Gin 






565 


570 


Gin 


He 


Lys Gin Ala He He 


Asn 






580 


585 


Lys 


Lys 


Val Ser Arg Val Gin 


Val 






595 


600 


Val 


Val 


Glu Ala Phe Asn Lys 


Val 






610 


615 


Pro 


Val 


Tyr He Pro Asn Leu 


Ser 






625 


630 


Thr 


Leu 


Pro Thr Arg Gly Tyr 


Lys 






640 


645 


Phe 


Ala 


Leu Glu Lys Ser Leu 


Thr 






655 


660 


Val 


Arg 


He Glu Val Thr Cys 


Asn 




670 


675 


Phe 


Tyr 


Asn Lys Cys Ser Leu 


Leu 






685 


690 
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Ser His Ala Arg Gly His Lys Glu Lys Gly Val Val Met Gin Cys 
695 700 705 

Ser His Leu He Leu Ser Gin Ser Gin Gin He Lys 
710 715 

<210> 254 
<211> 211 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID NO: LG: 1076828 .l.orfl :2000FEB18 
<220> 

<221> unsure 

<222> 25, 54, 93, 113, 116, 121 
<223> unknown or other 

<400> 254 



Thr 


Pro 


Lys Pro 


Gin 


Val 


He 


Ser 


Leu 


Leu 


Glu 


Gin Gly Lys 


Glu 


1 






5 










10 






15 


Pro 


Trp 


Met Val 


Gly 


Arg 


Glu 


Leu 


Thr 


Xaa 


Gly 


Leu Cys Ser 


Asp 








20 










25 






30 


Leu 


Glu 


Ser Met 


Cys 


Glu 


Thr 


Lys 


Leu 


Leu 


Ser 


Leu Lys Lys 


Glu 








35 










40 






45 


Val 


Tyr 


Glu He 


Glu 


Leu 


Cys 


Gin 


Xaa 


GlU 


He 


Met Gly Leu 


Thr 








50 










55 






60 


Lys 


His 


Gly Leu 


Glu 


Tyr 


Ser 


Ser 


Phe 


Gly 


Asp 


Val Leu Glu 


Tyr 








65 










70 






75 


Arg 


Ser 


His Leu 


Ala 


Lys 


Gin 


Leu 


Gly 


Tyr 


Pro 


Asn Gly His 


Phe 








80 










85 






90 


Ser 


Gin 


Xaa He 


Phe 


Thr 


Pro 


Glu 


Tyr 


Met 


Pro 


Thr Phe lie 


Gin 








95 










100 






105 


Gin 


Thr 


Phe Leu 


Thr 


Leu 


His 


Xaa 


He 


He 


Xaa 


Asn Glu Asp 


Arg 








110 










115 






120 


Xaa 


Tyr 


Glu cys 


Lys 


Glu 


Cys 


Gly 


Lys 


Met 


Phe 


Ser His Gly 


Ser 








125 










130 






135 


Gin 


Leu 


Thr Gin 


His 


Gin 


Arg 


He 


His 


Thr 


Gly 


Glu Lys Pro 


Tyr 








140 










145 






150 


Gin 


Cys 


Lys Glu 


Cys 


Gly 


Lys 


Ala 


Phe 


Asn 


Arg 


Gly Ser Leu 


Leu 








155 










160 






165 


Thr 


Arg 


His Gin 


Arg 


He 


His 


Thr 


Gly 


Glu 


Lys 


Pro Tyr Glu 


Cys 








170 










175 






180 


Lys 


Glu 


Cys Gly 


Lys 


Thr 


Phe 


Ser 


Arg 


Gly 


Ser 


Glu Leu Thr 


Gin 




185 










190 






195 


His 


Glu 


Arg He 


His 


Thr 


Ala 


Gly 


Ala 


Pro 


Leu 


Leu Ser Trp 


Gly 






200 










205 






210 



Leu 



<210> 255 
<211> 103 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No: LG: 1076931 .l.orf2 :2000PEB18 
<400> 255 

Gly Gly Glu Gly Gin Ser Ala Asp Leu Arg Pro Leu Leu Thr Asp 
15 10 15 

Phe Arg Leu Gin Phe He Cys Ala Pro Ala Ser Ser Leu Ser Leu 
20 25 30 

Arg Arg Leu Arg Leu Arg Pro Gin Lys Glu He Ser He Leu Cys 
"35 40 45 



132/228 



WO 01/62927 



Pro Glu Gin Asn Arg 


Met Ala Met Ser Gin Glu Ser Leu Thr 


Phe 


50 


55 


60 


Lys Asp Val Phe Val 


Gly Phe Thr Leu Glu Glu Trp Gin Gin 


Leu 


65 


70 


75 


Asp Pro Ser Gin Arg 


Ala Leu Tyr Val Met Leu Glu 


Asn 


80 


85 


90 


Tyr Ser Asn Leu Val 


Ser Val Gly Tyr Cys Ala His Lys 




'95 


100 





<210> 256 

<211> 84 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1078121 . 1 .orf3 :2000FEB18 
<400> 256 



Glu Gly He 


Pro Glu 


Lys 


Lys 


Glu Glu Glu Glu Glu Met Ala 


Gly 


1 


5 




10 


15 


Ser Gin Gly 


Leu Leu 


He 


Phe 


Arg Asp Val Ala He Glu Phe 


Ser 




20 






25 


30 


Pro Glu Glu 


Trp Ser 


Tyr 


Leu 


Asp Pro Ala Gin Gin Asn Leu 


Tvr 




35 




40 


45 


Arg Asp Val 


Met Leu 


Glu 


Asn 


Tyr Arg Asn Leu Val Ser Leu 


Gly 


50 






55 


60 


He Ala Val 


Ser Lys 


Pro 


Glu 


Leu He Thr Cys Leu Glu Gin 


Arg 




65 






70 


75 


Asn Glu Pro 


Trp Asn 


Val 


Lys 


Lys His 






80 











<210> 257 
<211> 194 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1079203 . 1 . orf 1 : 2000FEB18 
<400> 257 



Asp 


Ala 


Arg 


Thr 


Thr 


Trp 


Lys 


Pro 


Arg 


Asn 


Val 


He Tyr Ser 


His 


1 








5 










10 






15 


Phe 


Thr 


Glu 


Asp 


Leu 


Trp 


Pro 


GlU 


His 


Ser 


lie 


Lys Asp Ser 


Phe 










20 








25 






30 


Gin 


Lys 


Val 


He 


Leu 


Arg 


Gly 


Tyr 


Gly 


Lys 


cys 


Gly His Glu 


Asn 










35 










40 






45 


Leu 


Gin 


Leu 


Arg 


He 


Ser 


Cys 


Lys 


Ser 


Val 


Asp 


Glu Ser Lys 


Val 










50 










55 






60 


Phe 


Lys 


Glu 


Gly 


Tyr 


Asn 


Glu 


Leu 


Asn 


Gin 


Cys 


Leu Arg Thr 


Thr 








65 










70 






75 


Gin 


Ser 


Lys 


He 


Phe 


Gin 


Cys 


Asp 


Lys 


Tyr 


Val 


Lys Val Phe 


His 










80 










85 






90 


Lys 


Phe 


Ser 


Asn 


Ser 


Asn 


Ser 


His 


Lys 


Lys 


Arg 


Asn Thr Gly 


Lys 










95 










100 






105 


Lys 


Val 


Phe 


Lys 


Cys 


Lys 


Glu 


Cys 


Gly 


Lys 


Ser 


Phe Cys Met 


Leu 










110 










115 






120 


Ser 


His 


Leu 


Thr 


Gin 


His 


He 


Arg 


He 


His 


Thr 


Arg Glu Asn 


Ser 










125 










130 






135 


Tyr 


Lys 


Cys 


Lys 


Glu 


Cys 


Gly 


Lys 


Val 


Leu 


Asn 


Gin Ser Ser 


Glu 










140 










145 






150 


Leu 


He 


Lys 


His 


Lys 


Lys 


He 


His 


Thr 


Gly 


Glu 


Lys Pro Tyr 


Thr 










155 










160 






165 


Cys 


Glu 


Lys 


Cys 


Gly 


Lys 


Thr 


Phe 


Asn 


Gin 


Ser 


Ala Asn Leu 


Tyr 










170 










175 






180 


Ala 


His 


Lys 


Lys 


He 


His 


Thr 


Gly 


Asp 


Lys 


Thr 


He Gin Val 
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185 190 

<210> 258 
<211> 129 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1082586 . 1 .or fl :2000FEB18 
<220> 

<221> unsure 
<222> 5 

<223> unknown or other 
<400> 258 



Ala 


Gin 


Pro 


Arg Xaa 


Pro 


Met 


Gly 


Gin 


Tyr 


Gin 


Ala Asp Gin 


Tyr 


1 






5 










10 






15 


He 


His 


Arg 


Arg Ser 


Ser 


Ser 


Arg 


Arg 


Glu 


Lys 


Gly Ala Glu 


Arg 








20 










25 






30 


He 


Leu 


Glu 


Glu He 


Met 


Ala 


Glu 


Asn 


Phe 


Ser 


Ser Leu He 


Lys 








35 










40 






45 


Asp 


Met 


Asn 


He Asn 


He 


Gin 


Glu 


Ala 


Gin 


Gin 


Thr Pro Ser 


Met 






50 










55 






60 


Met 


Asn 


Ser 


Lys He 


Ala 


Thr 


Leu 


Arg 


His 


He 


He He Lys 


Leu 








65 










70 






75 


Ser 


Lys 


Asp 


Lys His 


Arg 


Pro 


Leu 


Ser 


Leu 


Thr 


Ala Ala Arg 


Ala 








80 










85 






90 


Pro 


Ser 


Leu 


Val Phe 


Thr 


Ser 


Leu 


Phe 


Leu 


Leu 


Leu Leu Glu 


Ala 








95 










100 






105 


Gin 


Pro 


Leu 


Trp Pro 


Cys 


Asp 


Leu 


Gin 


Val 


Leu 


Gly Asp Pro 


Leu 








110 










115 






120 


Leu 


Arg 


Cys 


Gin Asp 


Pro 


Leu 


Glu 


Ala 











125 

<210> 259 

<211> 93 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1082774 . 1 .orf 3 :2000FEB18 
<400> 259 



Pro 


Ala 


Gly 


He 


Arg 


Arg Val Thr Ala 


Arg 


Thr Pro Gly Pro 


Pro 


1 








5 




10 




15 


Gly 


Ser 


Leu 


Glu 


Met 


Gly Pro Leu Gin 


Phe 


Arg Asp Val Ala 


He 








20 


25 




30 


Glu 


Phe 


Ser 


Leu 


Glu 


Glu Trp His Cys 


Leu 


Asp Ala Ala Gin 


Arg 










35 


40 


45 


Asn 


Leu 


Tyr 


Arg 


Asp 


Val Met Leu Glu 


Asn 


Tyr Arg Asn Leu 


He 










50 




55 




60 


Phe 


Leu 


Gly 


He 


Val 


Val Ser Lys Pro 


Asn 


Leu He Thr Cys 


Leu 








65 


70 


75 


Glu 


Gin 


Gly 


Lys 


Lys 


Pro Leu Thr Met 


Lys 


Arg His Glu Met 


He 










80 




85 




90 


Ala 


Lys 


Pro 















<210> 260 
<211> 193 
<212> PRT 
<213> Homo sapiens 

<220> 



134/228 



WO 01/62927 



PCT/US01/06059 



<221> misc_feature 

<223> Incyte ID No: LG: 1082775 .1 .or f 3 :2000FEB18 
<400> 260 



Lys 


His 


Glu lie 


He 


His Phe Glu Glu 


Glu 


Pro 


Ser Glu Tyr 


Asn 


1 






5 




10 




15 


Asn 


Asn 


Gly Asn 


Ser 


Phe Trp Leu Asn 


Glu 


Asp 


Leu He Trp 


His 








20 




25 






30 


Gin 


Lys 


He Lys 


Asn 


Trp Glu Gin Pro 


Phe 


Glu 


Tyr Asn Glu 


Cys 








35 




40 






45 


Gly 


Lys 


Ala Phe 


Pro 


Glu Asn Ser Leu 


Phe 


Leu 


Val His Lys 


Arg 








50 




55 






60 


Ala 


Tyr 


Thr Gly 


Gin 


Lys Thr Cys Lys 


Tyr 


Thr 


Glu His Gly 


Lys 








65 




70 






75 


Thr 


Cys 


Tyr Met 


Ser 


Phe Phe He Thr 


His 


Gin 


Gin Thr His 


Pro 








80 




85 






90 


Arg 


Glu 


Asn His 


Tyr 


Glu Cys Asn Glu 


Cys 


Gly 


Glu Ser He 


Phe 








95 




100 






105 


Glu 


Glu 


Ser He 


Leu 


Phe Glu His Gin 


Asn 


Val 


Tyr Pro Phe 


Ser 








110 




115 




120 


Gin 


Asn 


Leu Asn 


Pro 


Thr Leu He Gin 


Arg 


Thr 


His Ser He 


Ser 








125 




130 






135 


Asn 


He 


He Glu 


Tyr 


Asn Glu Cys Gly 


Thr 


Phe 


Phe Ser Glu 


Lys 








140 




145 






1 crt 


Leu 


Ala 


Leu His 


Leu 


Gin Gin Arg Thr 


His 


Pro 


Gly Glu Lys 


Pro 








155 




160 




165 


Tyr 


Glu 


Cys His 


Glu 


Cys Gly Lys Thr 


Phe 


Thr 


Gin Lys Ser 


Ala 








170 




175 






180 


His 


Thr 


Arg His 


Gin 


Arg Thr His Thr 


Gly 


Lys 


Thr Leu 










185 




190 









<210> 261 
<211> 111 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 1083120 . 1 . orf3 :2000FEB18 
<400> 261 



Pro Gly 


Leu Arg 


Asp 


Leu 


Thr Cys Lys 


Glu 


Leu 


Leu He Leu 


Thr 


1 




5 






10 






15 


Glu Arg 


Glu Ala 


Gin 


Lys 


Arg Lys Lys 


Arg 


Lys 


Glu Lys Glu 


Ser 






20 






25 






30 


Gly Met 


Ala Leu 


Thr 


Gin 


Gly Pro Leu 


Thr 


Phe 


Arg Asp Val 


Ala 






35 




40 




45 


He Glu 


Phe Ser 


Gin 


Glu 


Glu Trp Lys 


Ser 


Leu 


Asp Pro Val 


Gin 






50 






55 






60 


Lys Ala 


Leu Tyr 


Trp 


Asp 


Val Met Leu 


Glu 


Asn 


Tyr Arg Asn 


Leu 






65 






70 






75 


Val Phe 


Leu Gly 


Lys 


Asp 


Asn Phe Ala 


Leu 


Glu 


Val Lys He 


Cys 






80 






85 






90 


Pro Arg 


Val Phe 


Leu 


Tyr 


Phe Leu Cys 


Cys 


Leu 


Ser Val Gly 


Ala 






95 






100 




105 


Arg Ser 


He His 


Leu 


His 
















110 















<210> 262 
<211> 137 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1087707 . 1 .orf 3 :2000FEB18 



135/228 



WO 01/62927 



PCT/DS01/06059 



<400> 262 



Leu 


His 


His Ser 


Pro 


Cys 


Tyr 


1 






5 


lie 


His 


Arg Glu 


Glu 
20 


Gly 


Gly 


Val 


Leu 


Thr Phe 


Arg 
35 


Asp 


Val 


Trp 


Glu 


Cys Leu 


Asp 
50 


Ser 


Ala 


Met 


Leu 


Glu Asn 


Tyr 
65 


Gly 


Asn 


Phe 


Lys 


Pro Asp 


Leu 
80 


He 


Thr 


Trp 


Asn 


Ala Arg 


Arg 
95 


Gin 


Lys 


Tyr 


Asp 


Val Cys 


Asn 
110 


Glu 


Asp 


Phe 


Leu 


Asn Ser 


Glu 
125 


Gin 


Leu 


Phe 


Cys 











Pro 


Val 


Thr 


Cys 


Arg Tyr Trp Asp 






10 




15 


Thr 


Ser 


Gly 


Gly 


Trp Glu Met Arg 






25 




30 


Ala 


Val 


Glu 


Phe 


Ser Pro Glu Glu 






40 




45 


Gin 


Gin 


Arg 


Leu 


Tyr Arg Asp Val 






55 




60 


Leu 


Phe 


Ser 


Leu 


Gly Leu Ala He 






70 




75 


Tyr 


Leu 


Glu 


Gin 


Arg Lys Glu Pro 






85 




90 


Thr 


Val 


Ala 


Lys 


His Pro Asp Tyr 






100 




105 


Tyr 


Glu 


Tyr 


Asn 


Trp Ser Tyr Met 






115 




120 


Phe 


He 


Lys 


Phe 


Tyr Pro Thr Phe 






130 




135 



<210> 263 

<211> 68 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1090915 . 1 .orf 3 :2000FEB18 
<400> 263 

Met Phe Pro Val Leu Glu Pro His Gin Val Gly Leu He Arg Ser 
15 10 15 

Tyr Asn Ser Lys Thr Met Thr Cys Phe Gin Glu Leu Val Thr Phe 

20 25 30 

Arg Asp Val Ala He Asp Phe Ser Arg Gin Glu Trp Glu Cys Leu 

35 40 "*" 45 

Asp Pro Asn Gin Arg Asp Leu Tyr Arg Asp Val Met Leu Glu Asn 

50 55 60 

Tyr Arg Asn Leu Val Ser Leu Gly 

65 

<210> 264 
<211> 101 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1094230 . 1 .orf 3 :2000FEB18 
<400> 264 



Asp Thr 


Gly 


Thr Ser 


Trp 


Lys 


Pro 


Lys 


Met 


Gly 


Pro Leu Gin Phe 


1 




5 










10 




15 


Arg Asp 


Val 


Ala He 
20 


Glu 


Phe 


Ser 


Leu 


Glu 
25 


Glu 


Trp His Cys Leu 
30 


Asp Thr 


Ala 


Gin Arg 
A 35 


Asn 


Leu 


Tyr 


Arg 


Asn 


Val 


Met Leu Glu Asn 












40 




45 


Tyr Ser 


Asn 


Leu Val 


Phe 


Leu 


Gly 


He 


Thr 


Val 


Ser Lys Pro Asp 




50 








55 




60 


Leu He 


Thr 


Cys Leu 
65 


Glu 


Gin 


Gly 


Arg 


Lys 
70 


Pro 


Leu Thr Met Lys 
75 


Arg Asn 


Glu 


Met He 


Ala 


Lys 


Pro 


Ser 


Val 


Ser 


Phe Leu Gin Val 




80 










85 




90 


His Ser 


Glu 


Ser Gin 
95 


Ser 


Pro 


Leu 


His 


Asp 
100 


He 





136/228 



WO 01/62927 



<210> 265 

<211> 96 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 474848 .3 .orf 1:200 0FEB18 
<400> 265 



Ser 


Ala 


Ala Met 


Phe 


Pro 


Val 


Phe Ser 


Gly 


Cys 


Phe Gin 


Glu 


Leu 


1 






5 








10 








15 


Gin 


Glu 


Lys Asn 


Lys 
20 


Ser 


Leu 


Glu Leu 


Val 
25 


Ser 


Phe Glu 


Glu 


Val 
30 


Ala 


Val 


His Phe 


Thr 


Trp 


Glu 


Glu Trp 


Gin 


Asp 


Leu Asp Asp 


Ala 








35 








40 








45 


Gin Arg 


Thr Leu 


Tyr 


Arg 


Asp 


Val Met 


Leu 


Glu 


Thr Tyr 


Ser 


Ser 








50 








55 








60 


Leu 


Val 


Ser Leu 


Gly 
65 


His 


Cys 


He Thr 


Lys 
70 


Pro 


Glu Met 


He 


Phe 
75 


Lys 


Leu 


Glu Gin 


Gly 
80 


Ala 


GlU 


Pro Trp 


He 
85 


Val 


Glu Glu 


Thr 


Leu 
90 


Asn 


Leu 


Arg Leu 


Ser 
95 


Gly 

















<210> 266 
<211> 251 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI :251656 . 1 .orf 2 :2000FEB01 
<220> 

<221> unsure 
<222> 234 

<223> unknown or other 
<400> 266 



GlU 


Asn 


Gly 


Glu 


Asn 


Cys 


Asn 


Gin Asp 


Met 


Phe 


Glu Asn Glu 


Ser 


1 








5 








10 






15 


Arg 


Lys 


He 


Phe 


Ser 
20 


Glu 


Met 


Pro Glu 


Gly 
25 


Glu 


Ser Ala Gin 


His 
30 


Ser 


Asp 


Gly 


Glu 


Ser 
35 


Asp 


Phe 


Glu Arg 


Asp 
40 


Ala 


Gly He Gin 


Arg 
45 


Leu 


Gin 


Gly 


His 


Thr 
50 


Pro 


Gly 


Glu Asp 


His 
55 


Gly 


Glu Val Val 


Ser 
60 


Gin 


Asp 


Arg 


GlU 


Val 
65 


Gly 


Gin 


Leu He 


Gly 
70 


Leu 


Gin Gly Thr 


Tyr 
75 


Leu 


Gly 


Glu 


Lys 


Pro 
80 


Tyr 


Glu 


Cys Pro 


Gin 
85 


Cys 


Gly Lys Thr 


Phe 
90 


Ser 


Arg 


Lys 


Ser 


His 
95 


Leu 


He 


Thr His 


Glu 
100 


Arg 


Thr His Thr 


Gly 
105 


Glu 


Lys 


Tyr 


Tyr 


Lys 
110 


Cys 


Asp 


Glu Cys 


Gly 
115 


Lys 


Ser Phe Ser 


Asp 
120 


Gly 


Ser 


Asn 


Phe 


Ser 


Arg 


His 


Gin Thr 


Thr 


His 


Thr Gly Glu 


Lys 










125 






130 




135 


Pro 


Tyr 


Lys 


Cys 


Arg 
140 


Asp 


Cys 


Gly Lys 


Ser 
145 


Phe 


Ser Arg Ser 


Ala 
150 


Asn 


Leu 


lie 


Thr 


His 
155 


Gin 


Arg 


He His 


Thr 
160 


Gly 


Glu Lys Pro 


Phe 
165 


Gin 


Cys 


Ala 


Glu 


Cys 
170 


Gly 


Lys 


Ser Phe 


Ser 
175 


Arg 


Ser Pro Asn 


Leu 
180 


He 


Ala 


His 


Gin 


Arg 
185 


Thr 


His 


Thr Gly 


Glu 
190 


Lys 


Pro Tyr Ser 


Cys 
195 


Pro 


Glu 


Cys 


Gly 


Lys 


Ser 


Phe 


Gly Asn 


Arg 


Ser 


Ser Leu Asn 


Thr 



137/228 



WO 01/62927 



PCT/US01/06059 



200 205 210 

His Gin Gly He His Thr Gly Glu Lys Pro Tyr Glu Cys Lys Glu 

215 220 225 

Cys Gly Glu Ser Phe Ser Tyr Asn Xaa Asn Leu He Arg Gin Gin 

230 235 240 

Arg He His Thr Gly Glu Lys Pro Tyr Lys Cys 

245 250 



<210> 267 
<211> 522 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI: 021371 . 1 ,orf3 :2000FEB01 
<400> 267 



Pro 


Gly 


Met 


Ser 


Val 


Ala 


Gly 


Val 


Glu 


Gly 


Glu 


Pro 


Leu Val Ser 


1 






5 








10 






15 


Ser 


Gin 


Ser 


Gly 


Gin 
20 


Ser 


Pro 


Pro 


Glu 


Pro 
25 


Gin 


Asp 


Pro Glu Ala 
30 


Pro 


Ser 


Ser 


Ser 


Gly 
35 


Pro 


Gly 


His 


Leu 


Val 
40 


Ala 


Met 


Gly Lys Val 
45 


Ser 


Arg 


Thr 


Pro 


Val 
50 


Glu 


Ala 


Gly 


Val 


Ser 
55 


Gin 


Ser 


Asp Ala Glu 
60 


Asn 


Ala 


Ala 


Pro 


Ser 
65 


Cys 


Pro 


Asp 


Glu 


His 
70 


Asp 


Thr 


Leu Pro Arg 
75 


Arg 


Arg 


Gly 


Arg 


Pro 
80 


Ser 


Arg 


Arg 


Phe 


Leu 
85 


Gly 


Lys 


Lys Tyr Arg 
90 


Lys 


Tyr 


Tyr 


Tyr 


Lys 
95 


Ser 


Pro 


Lys 


Pro 


Leu 
100 


Leu 


Arg 


Pro Phe Leu 
105 


cys 


Arg 


He 


Cys 


Gly 
110 


Ser 


Arg 


Phe 


Leu 


Ser 
115 


His 


Glu 


Asp Leu Arg 
120 


Phe 


His 


Val 


Asn 


Ser 
125 


His 


Glu 


Ala 


Gly 


Asp 
130 


Pro 


Gin 


Leu Phe Lys 
135 


Cys 


Leu 


Gin 


Cys 


Ser 
140 


Tyr 


Arg 


Ser 


Arg 


Arg 
145 


Trp 


Ser 


Ser Leu Lys 
150 


Glu 


His 


Met 


Phe 


Asn 
155 


His 


Val 


Gly 


Ser 


Lys 
160 


Pro 


Tyr 


Lys Cys Asp 
165 


Glu 


Cys 


Ser 


Tyr 


Thr 
170 


Ser 


Val 


Tyr 


Arg 


Lys 
175 


Asp 


Val 


He Arg His 
180 


Ala 


Ala 


Val 


His 


Ser 
185 


Arg 


Asp 


Arg 


Lys 


Lys 
190 


Arg 


Pro 


Asp Pro Thr 
195 


Pro 


Lys 


Leu 


Ser 


Ser 
200 


Phe 


Pro 


cys 


Pro 


Val 
205 


Cys 


Gly 


Arg Val Tyr 
210 


Pro 


Met 


Gin 


Lys 


Arg 
215 


Leu 


Thr 


Gin 


His 


Met 
220 


Lys 


Thr 


His Ser Thr 
225 


Glu 


Lys 


Pro 


His 


Met 
230 


Cys 


Asp 


Lys 


Cys 


Gly 
235 


Lys 


Ser 


Phe Lys Lys 
240 


Arg 


Tyr 


Thr 


Phe 


Lys 
245 


Met 


His 


Leu 


Leu 


Thr 
250 


His 


He 


Gin Ala Val 
255 


Ala 


Asn 


Arg 


Arg 


Phe 
260 


Lys 


Cys 


Glu 


Phe 


Cys 
265 


Glu 


Phe 


Val Cys Glu 
270 


Asp 


Lys 


Lys 


Ala 


Leu 
275 


Leu 


Asn 


His 


Gin 


Leu 
280 


Ser 


His 


Val Ser Asp 
285 


Lys 


Pro 


Phe 


Lys 


Cys 


Ser 


Phe 


Cys 


Pro 


Tyr 


Arg 


Thr 


Phe Arg Glu 








290 










295 






300 


Asp 


Phe 


Leu 


Leu 


Ser 
305 


His 


Val 


Ala 


Val 


Lys 
310 


His 


Thr 


Gly Ala Lys 
315 


Pro 


Phe 


Ala 


Cys 


Glu 
320 


Tyr 


Cys 


His 


Phe 


Ser 
325 


Thr 


Arg 


His Lys Lys 
330 


Asn 


Leu 


Arg 


Leu 


His 


Val 


Arg 


Cys 


Arg 


His 


Ala 


Ser 


Ser Phe Glu 










335 








340 






345 


Glu 


Trp 


Gly 


Arg 


Arg 
350 


His 


Pro 


Glu 


Glu 


Pro 
355 


Pro 


Ser 


Arg Arg Arg 
360 


Pro 


Phe 


Phe 


ser 


Leu 


Gin 


Gin 


He 


GlU 


Glu 


Leu 


Lys 


Gin Gin His 



138/228 



WO 01/62927 



PCTYUS01/06059 









365 








370 






375 


Ser 


Ala 


Ala Pro 


Gly 
380 


Pro 


Pro 


Pro Ser 


Ser 
385 


Pro 


Gly Pro Pro 


GIU 
390 


lie 


Pro 


Pro Glu 


Ala 
395 


Thr 


Thr 


Phe Gin 


Ser 
400 


Ser 


Glu Ala Pro 


Ser 
405 


Leu 


Leu 


Cys Ser 


Asp 


Thr 


Leu 


Gly Gly 


Ala 


Thr 


He He Tyr 


Gin 






410 








415 






420 


Gin 


Gly 


Ala Glu 


Glu 


Ser 


Thr 


Ala Met 


Ala 


Thr 


Gin Thr Ala 


Leu 






425 








430 






435 


Asp 


Leu 


Leu Leu 


Asn 


Met 


Ser 


Ala Gin 


Arg 


Gly 


Pro Gly Gly 


Thr 






440 








445 






450 


Ala 


Leu 


Gin Val 


Cys 


Cys 


Leu 


Gly Thr 


Cys 


Ser 


Pro Ser Gin 


Leu 








455 




460 






465 


Pro 


Gin 


Tyr Pro 


Ala 


Leu 


His 


Trp Thr 


Leu 


Gly 


Leu Glu Glu 


Asn 






470 






475 






480 


Ser 


Val 


Ser Glu 


Leu 


Leu 


Arg 


Pro Trp 


Gly 


Leu 


Pro Gly Ser 


Gly 








485 






490 






495 


Gly 


Asp 


Arg Ser 


Ala 
500 


Glu 


Val 


Trp Trp 


Ala 
505 


Asn 


Arg Glu Glu 


Gin 
510 


Ala 


Leu 


Pro Arg 


Arg 
515 


Pro 


Gin 


Gly He 


Pro 
520 


Ser 


He 





<210> 268 
<211> 267 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 133095 . 1 .orf 2 :2000PEB01 
<220> 

<221> unsure 
<222> 36 

<223> unknown or other 
<400> 268 



Gly 


Leu 


Gly 


Glu Glu 


Val 


Pro Cys Ala 


Met 


Met 


Glu Gly Val 


Ala 


1 




5 








10 






15 


Ala 


Tyr 


Thr 


Gin Thr 


Glu 


Pro Glu Gly 


Ser 


Gin 


Pro Ser Thr 


Met 






20 








25 






30 


Asp 


Ala 


Thr 


Ala Val 


Xaa 


Gly He Glu 


Thr 


Lys 


Lys Glu Lys 


Glu 






35 








40 






45 


Asp 


Leu 


Cys 


Leu Leu 


Lys 


Lys 


Glu Glu 


Lys 


Glu 


Glu Pro Val 


Ala 




50 




55 






60 


Pro 


Glu 


Leu 


Ala Thr 


Thr 


Val 


Pro Glu 


Ser 


Ala 


Glu Pro Glu 


Ala 








65 








70 






75 


Glu 


Ala 


Asp 


Gly Glu 


Glu 


Leu 


Asp Gly 


Ser 


Asp 


Met Ser Ala 


He 








80 








85 






90 


He 


Tyr 


Glu 


He Pro 


Lys 


Glu 


Pro Glu 


Lys 


Arg 


Arg Arg Ser 


Lys 








95 






100 






105 


Arg 


Ser 


Arg 


Val Met 


Asp 


Ala 


Asp Gly 


Leu 


Leu 


Glu Met Phe 


His 








110 








115 






120 


Cys 


Pro 


Tyr 


Glu Gly 


Cys 


Ser 


Gin Val 


Tyr 


Val 


Ala Leu Ser 


Ser 






125 






130 






135 


Phe 


Gin 


Asn 


His Val 


Asn 


Leu 


Val His 


Arg 


Lys 


Gly Lys Thr 


Lys 








140 








145 






150 


Val 


Cys 


Pro 


His Pro 


Gly 


Cys 


Gly Lys 


Lys 


Phe 


Tyr Leu Ser 


Asn 








155 








160 






165 


His 


Leu 


Arg 


Arg His 


Met 


He 


lie His 


Ser 


Gly 


Val Arg Glu 


Phe 






170 








175 






180 


Thr 


Cys 


Glu 


Thr Cys 


Gly 


Lys 


Ser Phe 


Lys 


Arg 


Lys Asn His 


Leu 








185 






190 






195 


Glu 


Val 


His 


Arg Arg 


Thr 


His 


Thr Gly 


Glu 


Thr 


Pro Leu Gin 


Cys 








'200 






205 






210 


Glu 


He 


cys 


Gly Tyr 


Gin 


cys 


Arg Gin 


Arg 


Ala 


Ser Leu Asn 


Trp 








215 








220 






225 



139/228 



WO 01/62927 



PCT/US01/06059 



His Met Lys Lys His Thr Ala Glu Val Gin Tyr Asn Phe Thr Cys 
230 235 240 

Asp Arg Cys Gly Lys Arg Phe Glu Lys Leu Asp Ser Val Lys Phe 
245 2.50 255 

His Thr Leu Lys Ser His Pro Asp His Lys Pro Thr 
260 265 

<210> 269 
<211> 286 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI:23 6654.2.orf2:2000FEB01 
<400> 269 



Arg 


Pro 


Leu 


Pro Ala 


Asp 


Leu 


Pro 


Val 


Glv 


Glv 


His His Cys 


Leu 


1 






5 










10 






15 


His 


Glv 


Pro 


Gin Glu 


Ala 


Glv 


Leu 


Ser 


Ala 


Leu 


Gin Arg Pro 


Gin 






20 








25 






30 


Pro 


Arcr 


Pro 


Gly Leu 


Arcr 


Thr 


Arcr 


Gly 


Ala 


Glu 


Gly Leu Glu 


Leu 








35 










40 






45 


Pro 


Ala 


Leu 


Trp Gin 


Thr 


Val 


His 


Ser 


Glv 

u 


Leu 


Glu Ala Ala 


Ala 








50 










55 






60 


Ser 


Arc 


Pro 


Val Gly 


Pro 


Arcr 


Thr 


Val 


His 


Leu 


Pro Asp Arg 


He 






65 








70 






75 


Arg 


Gly 


Pro 


Gly Gly 


Pro 


Val 


Leu 


Gly 


Leu 


Ala 


Glu Val Ala 


Ala 






80 










85 






90 


Ala 


Val 


Ser 


Ala Val 


Val 


Gly 


Pro 


Ala 


Ala 


Glu 


Ala Lys Ser 


Pro 








95 








100 






105 


Arg 


Ala 


Ser 


Gly Ser 


Gly 


Leu 


Thr 


Arg 


Arg 


Ser 


Pro Pro Val 


Leu 








110 










115 






120 


Cys 


Ala 


Arg 


Arg Pro 


Ser 


Ala 


Pro 


Ser 


Ala 


Thr 


Ser Lys Cys 


Thr 




125 










130 






135 


Cys 


Ala 


His 


Thr Gin 


Ala 


Ser 


Gly 


Pro 


Met 


Leu 


Ala Thr Ser 


Val 






140 








145 






150 


Pro 


Thr 


Pro 


Ala Pro 


Arg 


Ala 


Ala 


Ser 


Ser 


Thr 


Ala Thr Arg 


Arg 








155 








160 






165 


Pro 


Thr 


Gly 


Arg Cys 


Arg 


Pro 


Arg 


Ala 


Pro 


Ser 


Trp Pro Thr 


Pro 








170 










175 






180 


Ala 


Arg 


Ser 


Arg Pro 


Leu 


Gin 


Pro 


Leu 


Arg 


Ser 


Arg Leu Ser 


Met 








185 










190 






195 


Leu 


Leu 


Pro 


Pro Pro 


Ala 


Pro 


Phe 


His 


Ala 


Ala 


Val Val Arg 


Gly 








200 










205 






210 


Leu 


Glu 


Pro 


Pro Pro 


Gin 


Gin 


Val 


Ser 


Arg 


Asn 


Pro Gly Leu 


Leu 








215 










220 






225 


Ala 


Val 


Gly 


Leu Lys 


Pro 


Ala 


Leu 


Val 


Glu 


Thr 


Leu Gly Glu 


Pro 








230 










235 






240 


Ser 


Pro 


Arg 


Asn Lys 


Glu 


Leu 


Thr 


Leu 


Gin 


Thr 


Ala Arg Arg 


His 






245 










250 






255 


His 


Pro 


Lys 


Arg Cys 


Pro 


Ser 


Gin 


Gly 


Ala 


Arg 


Ala Ala Gly 


Pro 






260 










265 






270 


Gly 


Ala 


Ala 


Val Ser 


Ser 


Ala 


Gly 


Ser 


He 


Leu 


Pro Thr Ala 


Ala 



275 280 285 

Thr 



<210> 270 
<211> 194 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI -.200009 . 1 .orf 3 :2000FEB01 



140/228 



WO 01/62927 



PC17US01/06059 



<400> 270 



Gly 


Leu 


Ser Pro 


Lys 


Ala 


Ala 


Asn Leu 


Ala 


Pro 


Thr 


Thr 


Gin 


Gin 


1 






5 








10 










15 


Arg 


Ser 


Val Val 


Phe 


Pro 


Gin 


Thr Pro 


Cys 


Ser 


Arg 


Asn 


Phe 


Ser 








20 








25 










30 


Leu 


Leu 


Asp Lys 


Ser 


Gly 


Pro 


He Glu 


Ser 


Gly 


Phe 


Asn 


Gin 


He 








35 






40 










45 


Asn 


Val 


Lys Asn 


Gin 


Arg 


Val 


Leu Ala 


Ser 


Pro 


Thr 


Ser 


Thr 


Ser 






50 






55 










60 


Gin 


Leu 


His Ser 


Glu 


Phe 


Ser 


Asp Trp 


His 


Leu 


Trp 


Lys 


Cys 


Gly 








65 








70 










75 


Gin 


Cys 


Phe Lys 


Thr 


Phe 


Thr 


Gin Arg 


He 


Leu 


Leu 


Gin 


Met 


His 






80 








85 










90 


Val 


Cys 


Thr Gin 


Asn 


Pro 


Asp 


Arg Pro 


Tyr 


Gin 


Cys 


Gly 


His 


Cys 








95 








100 










105 


Ser 


Gin 


Ser Phe 


Ser 


Gin 


Pro 


Ser Glu 


Leu 


Arg 


Asn 


His 


Val 


Val 








110 








115 










120 


Thr 


His 


Ser Ser 


Asp 


Arg 


Pro 


Phe Lys 


Cys 


Gly 


Tyr 


Cys 


Gly 


Arg 








125 








130 










135 


Ala 


Phe 


Ala Gly 


Ala 


Thr 


Thr 


Leu Asn 


Asn 


His 


He 


Arg 


Thr 


His 








140 








145 








150 


Thr 


Gly 


Glu Lys 


Pro 


Phe 


Lys 


Cys Glu 


Arg 


Cys 


Glu 


Arg 


Ser 


Phe 








155 








160 










165 


Thr 


Gin 


Ala Thr 


Gin 


Leu 


Ser 


Arg His 


Gin 


Arg 


Met 


Pro 


Asn 


Glu 








170 








175 








180 


Cys 


Lys 


Pro lie 


Thr 


Glu 


Ser 


Pro Glu 


Ser 


He 


Glu 


Val 


Asp 





185 190 

<210> 271 
<211> 263 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI:758502. l.orf3 :2000FEB01 
<400> 271 



Thr 


Leu 


He 


Lys 


His 


Gin 


Arg 


Thr His 


Thr 


Gly 


Glu 


Arg 


Pro 


Tyr 


1 








5 








10 










15 


Glu 


cys 


Pro 


Glu 


Cys 


Gly 


Lys 


Thr Phe 


Gly 


Arg 


Lys 


Pro 


His 


Leu 










20 








25 










30 


He 


Met 


His 


Gin 


Arg 


Thr 


His 


Thr Gly 


Glu 


Lys 


Pro 


Tyr 


Ala 


Cys 










35 








40 










45 


Leu 


Glu 


Cys 


His 


Lys 


Ser 


Phe 


Ser Arg 


Ser 


Ser 


Asn 


Phe 


He 


Thr 










50 








55 










60 


His 


Gin 


Arg 


Thr 


His 


Thr 


Gly 


Val Lys 


Pro 


Tyr 


Arg 


cys 


Asn 


Asp 










65 








70 










75 


Cys 


Gly 


Glu 


Ser 


Phe 


Ser 


Gin 


Ser Ser 


Asp 


Leu 


He 


Lys 


His 


Gin 








80 








85 








90 


Arg 


Thr 


His 


Thr 


Gly 


Glu 


Arg 


Pro Phe 


Lys 


Cys 


Pro 


Glu 


Cys 


Gly 










95 








100 










105 


Lys 


Gly 


Phe 


Arg 


Asp 


ser 


Ser 


His Phe 


Val 


Ala 


His 


Met 


Ser 


Thr 










110 








115 










120 


His 


Ser 


Gly 


Glu 


Arg 


Pro 


Phe 


Ser Cys 


Pro 


Asp 


Cys 


His 


Lys 


Ser 










125 








130 










135 


Phe 


Ser 


Gin 


Ser 


Ser 


His 


Leu 


Val Thr 


His 


Gin 


Arg 


Thr 


His 


Thr 










140 








145 








150 


Gly 


Glu 


Arg 


Pro 


Phe 


Lys 


Cys 


Glu Asn 


Cys 


Gly 


Lys 


Gly 


Phe 


Ala 










155 








160 








165 


Asp 


Ser 


Ser 


Ala 


Leu 


Thr 


Lys 


His Gin 


Arg 


He 


His 


Thr 


Gly 


Glu 










170 








175 








180 


Arg 


Pro 


Tyr 


Lys 


Cys 


Gly 


Glu 


Cys Gly 


Lys 


Ser 


Phe 


Asn 


Gin 


Ser 










185 








190 










195 


Ser 


His 


Phe 


lie 


Thr 


His 


Gin 


Arg He 


His 


Leu 


Gly 


Asp 


Arg 


Pro 










200 








205 








210 


Tyr 


Arg 


Cys 


Pro 


Glu 


Cys 


Gly 


Lys Thr 


Phe 


Asn 


Gin 


Arg 


Ser 


His 



141/228 



WO 01/62927 



215 220 225 

Phe Leu Thr His Gin Arg Thr His Thr Gly Glu Lys Pro Phe His 

230 235 240 

Cys Ser Lys Cys Asn Lys Ser Phe Arg Gin Lys Ala His Leu Leu 

245 250 255 

Cys His Gin Asp Thr His Leu lie 

260 

<210> 272 
<211> 142 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 344772 . 1 .orf 2 :2000FEB01 
<220> 

<221> unsure 
<222> 142 

<223> unknown or other 
<400> 272 



Ala 


Lys 


Asn Leu 


Phe 


Lys Met Asp lie Glu 


Asp 


Cvs Asn Gly Arg 


i 






5 


10 




15 


• Ser 


Tyr 


Val Ser 


Gly 


Ser Gly Asp Ser Ser 


Leu 


Glu Lys Glu Phe 








20 


25 




30 


Leu 


Gly 


Ala Pro 


Val 


Gly Pro Ser Val Ser 


Thr 


Pro Asn Ser Gin 








35 


40 




45 


His 


Ser 


Ser Pro 


Ser 


Arg Ser Leu Ser Ala 


Asn 


Ser He Lys Val 








50 


55 




60 


Glu 


Met 


Tyr Ser 


Asp 


Glu Glu Ser Ser Arg 


Leu 


Leu Gly Pro Asp 








65 


70 




75 


Glu 


Arg 


Leu Leu 


Glu 


Lys Asp Asp Ser Val 


He 


Val Glu Asp Ser 








80 


85 




90 


Leu 


Ser 


Glu Pro 


Leu 


Gly Tyr Cys Asp Gly 


Ser 


Gly Pro Glu Pro 








95 


100 




105 


His 


Ser 


Pro Gly 


Gly 


lie Arg Leu Pro Asn 


Gly 


Lys Leu Lys cys 








110 


115 




120 


Asp 


Val 


Cys Gly 


Met 


Val Cys lie Gly Pro 


Asn 


Val Leu Met Val 








125 


130 




135 


His 


Lys 


Arg Ser 


His 


Thr Xaa 












140 









<210> 273 
<211> 164 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI :789445 . 1 . orf 2 :2000FEB01 
<400> 273 



GlU 


His 


Arg 


Glu 


Ala 


Lys 


Ala 


Ser 


Gly 


Trp 


Val 


Thr Asp Gly Leu 


1 








5 










10 




15 


Leu 


Met 


Asp 


Ser 


Ser 


Gin 


His 


Leu 


Val 


Thr 


Phe 


Glu Asp Val Ala 










20 










25 




30 


Val 


Asp 


Phe 


Thr 


Gin 


Glu 


Glu 


Trp 


Thr 


Leu 


Leu 


Asp Gin Ala Gin 










35 










40 




45 


Arg 


Asp 


Leu 


Tyr 


Arg 


Asp 


Val 


Met 


Leu 


Glu 


Asn 


Tyr Lys Asn Leu 










50 










55 




60 


He 


He 


Leu 


Ala 


Gly 


Ser 


Glu 


Leu 


Phe 


Lys 


Arg 


Ser Leu Met Ser 










65 










70 




75 


Gly 


Leu 


Glu 


Gin 


Met 


GlU 


Glu 


Leu 


Arg 


Thr 


Gly 


Val Thr Gly Val 










80 










85 




90 


Leu 


Gin 


Glu 


Leu 


Asp 


Leu 


Gin 


Leu 


Lys 


Thr 


Lys 


Gly Ser Pro Leu 



142/228 



WO 01/62927 



PCTYUS01/06059 



95 100 105 

Leu Gin Asp lie Ser Ala Glu Arg Ser Pro Asn Gly Val Gin Leu 

110 115 120 

Glu Arg Ser Asn Thr Ala Glu Lys Leu Tyr Asp Ser Asn His Ser 

125 130 135 

Gly Lys Val Phe Asn Glu His Pro Phe Leu Met Thr His Met lie 

140 145 150 

Thr His lie Gly Glu Lys Thr Ser Glu Asp Asn Gin Ser Gly 

155 160 

<210> 274 
<211> 107 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI; 789657 . 1 .orf 2 :2000FEB01 
<220> 

<221> unsure 
<222> 5 

<223> unknown or other 
<400> 274 



Met 


Trp 


Gin Val 


Xaa 


Ser Lys Ser Ser His 


Leu 


Ala Val His 


Gin 


1 




5 


10 






15 


Arg 


lie 


His Thr 


Gly 


Glu Lys Pro Tyr Lys 


Cys 


Asn Arg Cys 


Gly 








20 


25 






30 


Lys 


Cys 


Phe Ser 


Gin 


Ser Ser Ser Leu Ala 


Arg 


His Gin Thr 


Val 




35 


40 




45 


His 


Thr 


Gly Glu 


Lys 


Pro Tyr lie Cys Ala 


Glu 


Cys Gly Lys 


Ala 








50 


55 






60 


Phe 


Ser 


Gin Lys 


Ser 


Asp Leu Val Val His 


Gin 


lie He His 


Thr 








65 


70 






75 


Gly 


Glu 


Lys Pro 


Asp 


Arg Cys Thr Val Cys 


Gly 


Lys Ala Phe 


He 








80 


85 






90 


Gin 


Lys 


Ser Gin 


Leu 


Thr Val His Gin Arg 


lie 


His Thr Leu 


Met 








95 


100 






105 


Lys 


Ser 















<210> 275 
<211> 105 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI :789808 . 1 .orf 3 :2000FEB01 
<400> 275 



Glu His 


Thr 


Asp 


Gly 


Lys 


Ser 


Tyr Ala 


Cys 


He Gin Cys Gly 


Lys 


1 






5 








10 




15 


Phe Phe 


Cys 


Cys 


Tyr 


Tyr 


Ser 


Phe Thr 


Glu 


His Leu Arg Arg 


His 








20 








25 




30 


Thr Gly 


Glu 


Lys 


Pro 


Phe 


Gly 


Cys Asn 


Glu 


Cys Gly Lys Thr 


Phe 








35 








40 




45 


His Gin 


Lys 


Leu 


Ala 


Leu 


He 


Val His 


Gin 


Arg Thr His He 


Arg 








50 








55 




60 


Gin Lys 


Pro 


Tyr 


Gly 


Cys 


Asn 


Glu Cys 


Gly 


Lys Ser Phe Cys 


Val 








65 








70 




75 


Lys Ser 


Lys 


Leu 


He 


Ala 


His 


His Arg 


Thr 


Tyr Thr Gly Glu 


Lys 








80 








85 


90 


Pro Tyr 


Glu 


Cys 


Asn 


Val 


Cys 


Gly Lys 


Leu 


Leu Leu Ser Gin 


Asn 








95 






100 




105 



143/228 



WO 01/62927 



PCT/US01/06059 



<210> 276 
<211> 149 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI:792919.1.orfl:2000FEB01 
<400> 276 



His 


Gin 


Met He 


His 


Met 


Gly 


Gin Asn 


Pro 


Tyr 


Asn Cys Lys Glu 


1 






5 






10 




15 


Cys 


Gly 


Lys Ser 


Phe 


Lys 


Trp 


Ser Ser 


Tyr 


Leu 


Leu Val His Gin 








20 








25 




30 


Arg 


Val 


His Thr 


Gly 


Glu 


Lys 


Pro Tyr 


Lys 


Cys 


Glu Glu Cys Gly 








35 








40 




45 


Lys 


Gly 


Tyr He 


Ser 


Lys 


Ser 


Gly Leu 


Asp 


Phe 


His His Arg Thr 








50 








55 




60 


His 


Thr 


Gly Glu 


Arg 


Ser 


Tyr 


Asn Cys 


Asp 


Asn 


Cys Gly Lys Ser 








65 








70 




75 


Phe 


Arg 


His Ala 


Ser 


Ser 


He 


Leu Asn 


His 


Lys 


Lys Leu His Cys 








80 








85 




90 


Gin 


Arg 


Lys Pro 


Leu 


Lys 


Cys 


Glu Asp 


Cys 


Gly 


Lys Arg Leu Val 








95 








100 




105 


cys 


Arg 


Ser Tyr 


Cys 


Lys 


Asp 


Gin Gin 


Arg 


Asp 


His ser Gly Glu 








110 








115 




120 


Asn 


Pro 


Ser Lys 


Cys 


Glu 


Asp 


Cys Gly 


Lys 


Arg 


Tyr Lys Arg Arg 








125 








130 




135 


Leu 


Asn 


Leu Asp 


He 


He 


Leu 


Ser Leu 


Phe 


Leu 


Asn Asp He 






140 








145 







<210> 277 
<211> 101 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No : LI : 793949 . 1 .orf 3 :2000FEB01 
<400> 277 



Asp Thr 


Gly Thr 


Ser 


Trp 


Lys 


Pro Lys 


Met 


Gly 


Pro Leu Gin Phe 


1 






5 








10 




15 


Arg Asp 


Val 


Ala 


He 


Asp 


Phe 


Ser Gin 


GlU 


Glu 


Trp His Cys Leu 








20 








25 




30 


Asp Thr 


Ala 


Gin 


Arg 


Asp 


Leu 


Tyr Arg 


Cys 


Val 


Met Leu Glu Asn 








35 








40 




45 


Tyr Ser 


Asn 


Leu 


Val 


Phe 


Leu 


Gly He 


Thr 


Val 


Ser Lys Pro Asp 








50 








55 




60 


Val He 


Ser 


Ser 


Leu 


Glu 


Gin 


Gly Arg 


Lys 


Pro 


Leu Thr Met Lys 








65 








70 




75 


Arg Asn 


Glu 


Met 


He 


Ala 


Lys 


Pro Ser 


Val 


Ser 


Phe Leu Gin Val 








80 






85 




90 


His Ser 


GlU 


Ser 


Gin 


Ser 


Pro 


Leu His 


Asp 


He 










95 
t 








100 







<210> 278 
<211> 137 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LI:794389 . l.orf3 :2000FEB01 
<220> 

<221> unsure 



144/228 



WO 01/62927 



PCT/US01/06059 



<222> 23 

<223> unknown or other 
<400> 278 



Gly 


Leu 


Gin 


Lys 


Thr 


Phe 


Cys 


Arg 


Val 


Met 


Gin 


Phe Thr Leu 


His 


1 








5 










10 






15 


Arg 


Arg 


lie 


His 


Thr 


Gly 


Glu 


Xaa 


Pro 


Tyr 


Glu 


Cys Lys Glu 


Cys 










20 










25 






30 


Gly 


Lys 


Ser 


Phe 


Ser 


Ala 


His 


Ser 


Ser 


Leu 


Val 


Thr His Lys 


Arg 






35 










40 






45 


Thr 


His 


Ser 


Gly Glu 


Lys 


Pro 


Tyr 


Lys 


Cys 


Lys 


Glu Cys Gly 


Lys 










50 










55 






60 


Ala 


Phe 


Ser 


Ala 


His 


Ser 


Ser 


Leu 


Val 


Thr 


His 


Lys Arg Thr 


His 










65 










70 




Gly Lys Ala 


75 


Ser 


Gly 


Glu 


Lys 


Pro 


Tyr 


Thr 


Cys 


His 


Ala 


Cys 


Phe 










80 










85 






90 


Asn 


Thr 


Ser 


Ser 


Thr 


Leu 


Cys 


Gin 


His 


Asn 


Arg 


He His Thr 


Gly 










95 








100 






105 


Glu 


Lys 


Pro 


Phe 


Gin 


Cys 


Ser 


Gin 


Cys 


Gly 


Lys 


Ser Phe Ser 


cys 










110 










115 






120 


Ser 


Ser 


His 


Leu 


Thr 


Arg 


His 


Cys 


Arg 


Met 


Cys 


Asn Gly Lys 


Phe 



125 130 135 

Ser Lys 



<210> 279 
<211> 97 . 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI-.796010. 1 .orf 3 :2000PEB01 
<220> 

<221> unsure 

<222> 4, 18, 23 

<223> unknown or other 

<400> 279 



Leu Cys 


lie 


Xaa Lys 


His 


Thr 


Gly 


GlU 


Lys 


Pro 


Tyr Glu Cys 


Tyr 


1 




5 










10 






15 


Ala Cys 


Xaa 


Asn Thr 


Phe 


Leu 


Xaa 


Lys 


Ser 


Asp 


Leu He Lys 


His 




20 










25 






30 


Gin Arg 


He 


His Thr 


Gly 


Glu 


Lys 


Pro 


Tyr 


Glu 


Cys Asn Glu 


cys 




35 






40 






45 


Gly Lys 


Ser 


Phe Ser 


Glu 


Lys 


Ser 


Thr 


Leu 


Thr 


Lys His Leu 


Arg 




50 










55 






60 


Thr His 


Arg 


Trp Glu 
65 


He 


Leu 


Cys 


Met 


Tyr 
70 


Ser 


Met Trp Lys 


He 
75 


Phe Leu 


Leu 


Leu Leu 
80 


Gin 


Phe 


His 


Arg 


Thr 
85 


Ser 


Glu Lys Thr 


His 
90 


Arg Gly 


Glu 


Thr Phe 
95 


Trp 


Met 















<210> 280 

<211> 97 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI:796324.1.orf2:2000FEB01 
<220> 

<221> unsure 
<222> 93 



145/228 



WO 01/62927 
<223> unknown or other 
<400> 280 



Leu Cys 


lie Arg 


Lys 


His Thr Gly Glu 


Lys 


Pro 


Tyr Glu Cys 


Tyr 


1 




5 




10 






15 


Ala Cys 


Gly Asn 


Thr 


Phe Leu Arg Lys 


Ser 


Asp 


Leu lie Lys 


His 






20 




25 






30 


Gin Arg 


lie His 


Thr 


Gly Glu Lys Pro 


Tyr 


Glu 


Cys Asn Glu 


Cys 






35 




40 






45 


Gly Lys 


Ser Phe 


Ser 


Glu Lys Ser Thr 


Leu 


Thr 


Lys His Leu 


Arg 






50 




55 






60 


Thr His 


Arg Trp 


Glu 


lie Leu Cys Met 


Tyr 


Ser 


Met Trp Lys 


He 






65 




70 






75 


Phe Leu 


Leu Leu 


Leu 


Gin Phe His Arg 


Thr 


Ser 


Glu Lys Thr 


His 






80 




85 






90 


Arg Gly 


Xaa Thr 


Phe 


Trp Met 














95 













<210> 281 
<211> 179 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 79 6373 . 1 .orf 1 :2000FEB01 



<400> 281 




Met 


Trp 


Glu 


Ser Phe 


1 






5 


Arg 


Phe 


His 


Thr Gly 
20 


Lys 


Ser 


Cys 


Ser His 
35 


His 


Thr 


Gly 


Glu Lys 
50 


Phe 


Arg 


Asp 


Lys Ser 
65 


Gly 


Glu 


Arg 


Pro Tyr 
80 


His 


Leu 


Ser 


Cys Leu 
95 


Glu 


Lys 


Cys 


Val Gly 
110 


Ser 


His 


Ser 


Leu . Ser 
125 


Ser 


Val 


Asn 


Met Val 
140 


Thr 


Ser 


Leu 


Thr Asn 
155 


He 


Val 


Ser 


Gin Pro 
170 



Ser 


Gin 


Lys Thr 


Cys 








10 


Lys 


Thr 


Pro Phe 


Val 






25 


Lys 


Ser 


Gly Leu 


He 








40 


Pro 


Tyr 


Thr Cys 


Ser 








55 


Cys 


Leu 


Asn Arg 


His 








70 


Gly 


Cys 


Ser Asp 


Cys 








85 


Val 


Tyr 


His Lys 


Gly 








100 


Ser 


Val 


Lys Leu 


Glu 






115 


His 


Thr 


Arg Asp 


Leu 








130 


Thr 


Leu 


Gin Met 


Pro 








145 


Ser 


Ala 


Phe Gin 


Ala 








160 


Val 


Ala 


Arg Ser 


Ser 






175 



Leu 


He Ser His 


Gin 






15 


Cys 


Thr Glu Cys 


Gly 






30 


Asn 


His Gin Arg 


He 






45 


Asp 


Cys Gly Lys 


Ala 






60 


Arg 


Arg Thr His 


Thr 






75 


Gly 


Lys Ala Phe 


Ser 






90 


Met 


Leu His Ala 


Arg 






105 


Asn 


Pro Cys Ser 


Glu 






120 


He 


Gin Asp Lys 


Asp 






135 


Ser 


Val Ala Ala 


Gin 






150 


Glu 


Ser Lys Val 


Ala 




165 


Val 


Ser Ala Asp 





<210> 282 

<211> 87 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI:796415.1.orf3:2000FEB01 



<400> 282 

Lys His Glu He He His Phe Glu Glu Glu Pro Ser Glu Tyr Asn 
1 5 10 15 

Asn Asn Gly Asn Ser Phe Trp Leu Asn Glu Asp Leu He Trp His 
20 25 30 



146/228 



WO 01/62927 



PCT/US01/06059 



Gin Lys lie Lys Asn Trp Glu Gin Pro Phe Glu Tyr Asn Glu Cys 

35 40 45 

Gly Lys Ala Phe Pro Glu Asn Ser Leu Phe Leu Val His Lys Arg 

50 55 60 

Ala Tyr Thr Gly Gin Lys Thr Cys Lys Tyr Thr Glu His Gly Lys 

65 70 75 

Thr Cys Tyr Met Ser Phe Phe lie Thr His Gin Gin 

80 85 

<210> 283 
<211> 172 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI:798636.1.orf2:2000FEB01 



<400> 283 










Asn 


Glu 


Cys 


Gly Lys 


Ala Leu Ser Ser 


His 


Ser Thr Leu He He 


1 




5 




10 


15 


His 


Glu 


Arg 


lie His 


Thr Gly Glu Lys 


Pro 


Cys Lys Cys Lys Val 








20 




25 


30 


Cys 


Gly 


Lys 


Ala Phe 


Arg Gin Ser Ser 


Ala 


Leu lie Gin His Gin 








35 




40 


45 


Arg 


Met 


His 


Thr Gly 


Glu Arg Pro Tyr 


Lys 


Cys Asn Glu Cys Asp 








50 




55 


60 


Lys 


Thr 


Phe 


Arg Cys 


Asn Ser Ser Leu 


Ser 


Asn His Gin Arg He 






65 




70 


75 


His 


Thr 


Gly 


Glu Lys 


Pro Tyr Arg Cys 


Leu 


Glu Cys Gly Met Ser 








80 




85 


. 90 


Phe 


Gly 


Gin 


Ser Ala 


Ala Leu lie Gin 


His 


Gin Arg He His Thr- 






95 




100 


105 


Gly 


Glu 


i*ys 


Pro Phe 


Lys Cys Asn Thr 


Cys 


Gly Lys Thr Phe Arg 








110 




115 


120 


Gin 


Ser 


Ser 


Ser Leu 


lie Ala His Gin 


Arg 


He His Thr Gly Glu 








125 




130 


135 


Lys 


Pro 


Tyr 


Glu Cys 


Asn Ala Cys Gly 


Lys 


Leu Phe Ser Gin Arg 








140 




145 


150 


Ser 


Ser 


Leu 


Thr Asn 


His Tyr Lys He 


His 


He Glu Glu Asp Ser 








155 


160 


165 


Leu 


Lys 


Ala 


Asp Leu 


His Val 












170 









<210> 284 
<211> 151 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 800045 . 1 .orfl : 2000FEB01 
<220> 

<221> unsure 
<222> 104 

<223> unknown or other » 



<400> 284 

Lys He Met His Thr Gly Glu Lys Arg Tyr Glu cys Asp Asp Cys 
15 10 15 

Gly Gly Thr Phe Arg Ser Ser Ser Ser Leu Arg Val His Lys Arg 

20 25 30 

He His Thr Gly Glu Lys Pro Tyr Lys Cys Glu Glu cys Gly Lys 

35 40 45 

Ala Tyr Met Ser Tyr Ser Ser Leu He Asn His Lys Ser Thr His 

50 55 60 



147/228 



WO 01/62927 



PCT/US01/06059 



Ser Gly 


Glu Lys 


Asn 


Cys 


Lys Cys Asp 


Glu 


Cys 


Gly Lys Ser Phe 






65 






70 




75 


Asn Tyr 


Ser Ser 


Val 


Leu 


Asp Gin His 


Lys 


Arg 


He His Thr Gly 






80 






85 




90 


Glu Lys 


Pro Tyr 


Glu 


Cys 


Gly Glu Cys 


Gly 


Lys 


Ala Phe Xaa Asn 






95 






100 




105 


Ser Ser 


Gly Leu 


Arg 


Val 


His Lys Arg 


He 


His 


Thr Gly Glu Lys 






110 






115 




120 


Pro Tyr 


Glu Cys 


Asp 


lie 


Cys Gly Lys 


Thr 


Phe 


Ser Asn Ser Ser 






125 






130 




135 


Gly Leu 


Thr Val 


His 


Lys 


Arg lie His 


Thr 


Val 


Ser Asp Glu Leu 






140 






145 




150 



Pro 



<210> 285 

<211> 89 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 800680. 1 .orf 2 :2000FEB01 
<4CC> 285 



Leu Thr Tyr Leu 


Arg 


Lys Lys Leu Arg Gly 


Arg 


Gly Lys Lys Glu 


1 


5 


10 




15 


Glu Glu Gly Met 


Ala 


Leu Ser Gin Gly Leu 


Phe 


Thr Phe Lys Asp 




20 


25 




30 


Val Ala He Glu 


Phe 


Ser Gin Glu Glu Trp 


Glu 


Cys Leu Asp Pro 




35 


40 




45 


Ala Gin Arg Ala 


Leu 


Tyr Arg Asp Val Met 


Leu 


Glu Asn Tyr Arg 




50 


55 




60 


Asn Leu Leu Ser 


Leu 


Asp Glu Asp Asn He 


Pro 


Pro Glu Asp Gly 




65 


70 




75 


Ser His Leu Ala 


Ala 


Cys Gly Gin Ser Thr 


Leu 


Pro Leu Pro 




80 


85 







<210> 286 
<211> 146 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI: 800894 . 1 .orf 2 :2000FEB01 
<400> 286 



Pro 


Ala 


Gly 


He 


Gly 


Arg 


Ser 


Thr Thr 


Lys 


Ser 


Pro Gly Pro Pro 


1 








5 








10 




15 


Gly 


Ser 


Leu 


Glu 


Met 


Gly 


Ser 


Leu Thr 


Phe 


Arg 


Asp Val Ala He 










20 








25 




30 


Glu 


Phe 


Ser 


Leu 


Glu 


Glu 


Trp 


Gin Cys 


Leu 


Asp 


Thr Ala Gin Gin 










35 




40 


45 


Asn 


Leu 


Tyr 


Arg 


Asn 


Val 


Met 


Leu Glu 


Asn 


Tyr 


Arg Asn Leu Val 








50 








55 


60 


Phe 


Leu 


Gly 


He 


Ala 


Ala 


Phe 


Lys Pro 


Asp 


Leu 


He He Phe Leu 








65 








70 




75 


Glu 


Glu 


Gly 


Lys 


Glu 


Ser 


Trp 


Asn Met 


Lys 


Arg 


His Glu Met Val 










80 








85 




90 


Glu 


Glu 


Ser 


Pro 


Val 


He 


Cys 


Ser His 


Phe 


Ala 


Gin Asp Leu Trp 










95 








100 




105 


Pro 


Glu 


Gin 


Gly 


He 


Glu 


Asp 


Ser Phe 


Gin 


Lys 


Val He Leu Arg 










110 








115 




120 


Arg 


Tyr 


Lys 


He 


His 


His 


His 


Ala Cys 


Glu 


Leu 


Gly Pro He Met 










125 








130 




135 


Asn 


His 


Tyr 


Pro 


Thr 


Cys 


Gly 


Gin Met 


His 


He 





148/228 



WO 01/62927 



PC17US01/06059 



140 145 

<210> 287 
<211> 78 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI:801015. l.orf 1:2000FEB01 
<400> 287 

Gly Ser Arg Lys Met Asp Ser Val Ala Phe Glu Asp Val Ala Val 
15 10 15 

Asn Phe Thr Gin Glu Glu Trp Ala Leu Leu Asp Pro Trp Gin Lys 

20 25 30 

Lys Leu Tyr Arg Asp Val Met Leu Glu Thr Tyr Arg Asn Leu Ala 

35 40 45 

Ser Val Gly Asp Asp Asp Asn lie Pro Ser Leu Arg Glu Gin Val 

50 55 60 

Ala His Gin Arg Tyr Phe Lys Thr Trp His Val Glu Arg Glu Tyr 

65 70 75 

Phe Ser Lys 



<210> 288 
<211> 126 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI :801236 . 1 .orf 3 :2000FEB01 
<220> 

<221> unsure 
<222> 4 

<223> unknown or other 
<400> 288 



Met 


Trp 


Glu 


Xaa Phe 


Ser 


His Thr Pro 


Ala 


Phe 


He Gin His Gin 


1 




5 






10 




15 


Arg 


lie 


His 


Thr Gly 


Glu 


Lys Pro Tyr 


Glu 


Cys 


Asn Ala Cys Gly 








20 






25 




30 


Lys 


Ala 


Phe 


Asn Arg 


Ser 


Ala His Leu 


Thr 


Glu 


His Gin Arg Thr 






35 






40 




45 


His 


Thr 


Gly 


Glu Lys 


Pro 


Tyr Val Cys 


Lys 


Glu 


Cys Gly Lys Thr 








50 






55 




60 


Phe 


Ser 


Arg 


Ser Thr 


His 


Leu Thr Glu 


His 


Leu 


Lys He His Ser 








65 






70 




75 


Cys 


Val 


Lys 


Pro Tyr 


Gin 


Cys Asn Glu 


Cys 


Gin 


Lys Leu Phe Cys 








80 






85 




90 


Tyr 


Arg 


Thr 


Ser Leu 


lie 


Arg His Gin 


Arg 


Thr 


His Thr Gly Glu 








95 




100 




105 


Lys 


Pro 


Tyr 


Gin Cys 


Asn 


Glu Cys Gly 


Lys 


Ser 


Phe Ser Leu Ser 








110 






115 




120 


Ser 


Ala 


Leu 


Thr Lys 


His 
















125 













<210> 289 

<211> 96 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI:803335 . l.orfl:2000FEB01 



149/228 



WO 01/62927 



PCT/US01/06059 



<400> 289 



Ser Ala 


Ala Met 


Phe 


Pro 


Val 


Phe Ser 


Gly 


Cys 


Phe Gin Glu 


Leu 


1 




5 








10 






15 


Gin Glu 


Lys Asn 


Lys 


Ser 


Leu 


Glu Leu 


Val 


Ser 


Phe Glu Glu 


Val 




20 








25 






30 


Ala Val 


His Phe 


Thr 


Trp 


Glu 


Glu Trp 


Gin 


Asp 


Leu Asp Asp 


Ala 






35 






40 






45 


Gin Arg 


Thr Leu 


Tyr 
50 


Arg 


Asp 


Val Met 


Leu 
55 


Glu 


Thr Tyr Ser 


Ser 
60 


Leu Val 


Ser Leu 


Gly 


His 


Cys 


lie Thr 


Lys 


Pro 


Glu Met He 


Phe 






65 






70 






75 


Lys Leu 


Glu Gin 


Gly 


Ala 


Glu 


Pro Trp 


lie 


Val 


Glu Glu Thr 


Leu 




80 








85 






90 


Asn Leu 


Arg Leu 


Ser 
95 


Gly 















<210> 290 
<211> 149 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI:803998.1.orfl:2000FEB01 
<400> 290 



Lys 


Asn 


Ser 


Tyr Trp 


Arg 


Lys 


Asn 


Pro 


Thr 


Asn 


Met 


Lys Asn 


Val 


1 






5 










10 








15 


Ala 


Lys 


Leu 


Leu He 


Asn 


Ser 


Gin 


Arg 


Leu 


Leu 


Asn 


He Arg 


Glu 






20 










25 








30 


Phe 


Val 


Gin 


Glu Gly 


Asn 


Pro 


Thr 


Asn 


Leu 


Lys 


Asn 


Val Ala 


Ser 








35 










40 






45 


Leu 


Leu 


Ala 


He Pro 


Gin 


Ser 


Leu 


Leu 


Asn 


He 


His 


Val He 


His 








50 










55 








60 


Thr 


Gly 


Gly 


Asn Ser 


Tyr 


Asn 


Cys 


Val 


Glu 


Cys 


Cys 


Asn Ala 


Leu 








65 










70 








75 


Asn 


Gin 


Ser 


Leu Arg 


Leu 


Thr 


Thr 


Tyr 


Lys 


Thr 


Thr 


His Thr 


Gly 








80 










85 








90 


Glu 


Lys 


Pro 


Cys Met 


Cys 


Glu 


Glu 


Cys 


Gly 


Lys 


Ala 


Ser Asn 


Arg 








95 










100 








105 


Ser 


Ser 


He 


Leu Lys 


Arg 


His 


Lys 


Leu 


He 


His 


Thr 


Gin Glu 


Arg 








110 










115 








120 


Leu 


Tyr 


Lys 


Pro Glu 


Arg 


Cys 


Asp 


Asn 


Ala 


Phe 


Gly Asn Thr 


Ser 








125 










130 








135 


Asp 


Phe 


Ser 


Glu Tyr 


Lys 


Arg 


Asn 


Arg 


Thr 


Asp 


Glu Lys Ser 





140 145 

<210> 291 
<211> 134 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 



<223> Incyte ID 


No: 


LI: 478757.1. 


,orf2:2000FEB01 




<400> 291 












Trp Trp Glu He 


Cys 


Ala His Ser 


Asp Val Ala 


Ala Glu Glu 


Gly 


1 


5 




10 




15 


Lys Ala Arg Arg 


Ser 


Arg Gin His 


Arg Phe Leu 


Gly Thr Cys 


Glu 




20 




25 




30 


Gly He Met Arg 


Arg 


Ala Glu Leu 


Ser Ser Gin 


Val Glu Asp 


Ser 


35 




40 




45 


Thr Leu His Ala 


Trp 


He Arg Tyr 


Ser Leu Val 


Leu Asp Val 


Asp 




50 


55 




60 


Cys Trp His He 


Ala 


Ala Gin Leu 


Glu Met Tyr 


Gly Cys Pro 


His 


65 




70 




75 
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Leu Asp Leu Thr Glu 
80 

Leu Leu Gly Phe Ser 
95 

Ser Gin Gly Ser Val 
110 

Gin Glu Glu Trp His 
125 

<210> 292 
<211> 212 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: LI: 808532 . 1 ,orf2:2000FEB01 

<400> 292 



His 


Asn 


Phe 


Gin Leu 


Gin Lys His His 


Arg 


He 


His Thr Gly Glu 


1 






5 




10 




15 


Lys 


Pro 


Phe 


Lys Cys 


Glu lie Cys Gly 


Lys 


Ser 


Phe Cys Leu Arg 








20 




25 




30 


Ser 


Ser 


Leu 


Asn Arg 


His Cys Met Val 


His 


TOr 


Ala Glu Lys Leu 








35 


40 




45 


Tyr 


Lys 


Ser 


Glu Lys 


Tyr Gly Arg Gly 


Phe 


He 


Asp Arg Leu Asp 








50 




55 




60 


Leu 


His 


Lys 


His Gin 


Met lie His Met 


Gly 


Gin 


Lys Pro Tyr Asn 






65 




70 




75 


Cys 


Lys 


Glu 


Cys Gly 


Lys Ser Phe Lys 


Trp 


Ser 


Ser Tyr Leu Leu 








80 




85 




90 


Val 


His 


Gin 


Arg Val 


His Thr Gly Glu 


Lys 


Pro 


Tyr Lys Cys Glu 








95 




100 




105 


Glu 


Cys 


Gly 


Lys Gly 


Tyr He Ser Lys 


Ser 


Gly 


Leu Asp Phe His 








110 




•115 




120 


His 


Arg 


Thr 


His Thr 


Gly Glu Arg Ser 


Tyr 


Asn 


Cys Asp Asn Cys 








125 




130 




135 


Gly 


Lys 


Ser 


Phe Arg 


His Ala Ser Ser 


He 


Leu 


Asn His Lys Lys 








140 




145 




150 


Leu 


His 


Cys 


Gin Arg 


Lys Pro Leu Lys 


Cys 


Glu 


Asp Cys Gly Lys 








155 




160 




165 


Arg 


Leu 


Val 


Cys Arg 


Ser Tyr Cys Lys 


Asp 


Gin 


Gin Arg Asp His 








170 




175 




180 


Ser 


Gly 


Glu 


Asn Pro 


Ser Lys Cys Glu 


Asp 


Cys 


Gly Lys Arg Tyr 








185 




190 




195 


Lys 


Arg 


Arg 


Leu Asn 


Leu Asp He He 


Leu 


Ser 


Leu Phe Leu Asn 






200 




205 




210 



Asp He 



Ser Arg Gly Ala Ala Ala Arg Lys Leu His ' 
85 90 
Ala Leu Pro Thr Leu Val Asp Met He Thr 
100 105 
Ser Phe Arg Asp Val Thr Met Gly Phe Thr 
115 120 
His Leu Asp Pro Ala Gin Arg Thr Leu 
130 



<210> 293 
<211> 108 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 443 073 . 1 .orf 2 :2000FEB01 
<400> 293 

Lys Pro Tyr Met Cys Lys Glu Cys Arg Lys Thr Phe Ser Gin Asn 
15 10 15 

Ala Gly Leu Ala Gin His Gin Arg He His Thr Gly Glu Lys Pro 

20 25 30 

Tyr Glu Cys Asn Val Cys Gly Lys Ala Phe Ser Tyr Ser Gly Ser 

35 40 45 

Leu Thr Leu His Gin Arg He His Thr Gly Glu Arg Pro Tyr Glu 
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50 


55 


60 


Cys 


Lys Asp 


Cys 


Arg 


Lys Ser Phe Arg Gin Arg Ala His Leu 


Ala 








65 


70 


75 


His 


His Glu 


Arg 


He 


His Thr Met Glu Ser Phe Leu Thr Leu 


Ser 






80 


85 


90 


Ser 


Pro Ser 


Pro 


Ser 


Thr Ser Asn Gin Leu Pro Arg Pro Val 


Gly 








95 


100 


105 


Phe 


He Ser 











<210> 294 

<211> 83 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 479671 . 1 .orf 1 :2000FEB01 
<400> 294 



Pro Ala 


Cys 


Thr Gly 


Gly Phe Ala Gly 


Arg Met Ser Gly His 


Pro 


1 




5 




10 


15 


Gly Ser 


Trp 


Glu Met 


Asn Ser Val Ala 


Phe Glu Asp Val Ala 


Val 




20 




25 


30 


Asn Phe 


Thr 


Gin Glu 


Glu Trp Ala Leu 


Leu Asp Pro Ser Gin 


Lys 






35 


40 


45 


Asn Leu 


Tyr 


Arg Asp 


Val Met Gin Glu 


Thr Phe Arg Asn Leu 


Ala 






50 




55 


60 


Ser He 


Gly 


Asn Lys 


Gly Glu Asp Gin 


Ser He Glu Asp Gin 


Tyr 






65 




70 


75 


Lys Asn 


Ser 


Ser Arg 


Asn Leu Arg 










80 









<210> 295 
<211> 180 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI:810078 . 1 .orf 1 :2000FEB01 
<400> 295 



Pro 


Tyr 


Val 


Cys 


Lys 


Glu 


Cys 


Gly Lys 


Ala 


Phe 


Thr Gin Tyr 


Ser 


1 








5 








10 






15 


Gly 


Leu 


Ser 


Met 


His 


Val 


Arg 


Ser His 


Ser 


Gly 


Asp Lys Pro 


Tyr 








20 








25 






30 


Glu 


Cys 


Lys 


Glu 


Cys 


Gly 


Lys 


Ser Phe 


Leu 


Thr 


Ser Ser Arg 


Leu 










35 






40 






45 


He 


Gin 


His 


He 


Arg 


Thr 


His 


Thr Gly 


Glu 


Lys 


Pro Phe Val 


Cys 










50 








55 




60 


Val 


Glu 


Cys 


Gly 


Lys 
65 


Ala 


Phe 


Ala Val 


Ser 
70 


Ser 


Asn Leu Ser 


Gly 
75 


His 


Leu 


Arg 


Thr 


His 
80 


Thr 


Glu 


Glu Lys 


Ala 
85 


Cys 


Glu Cys Lys 


He 
90 


Cys 


Gly 


Lys 


Val 


Phe 
95 


Gly 


Tyr 


Pro Ser 


Cys 
100 


Leu 


Asn Asn His 


Met 
105 


Arg 


Thr 


His 


Ser 


Ala 
110 


Gin 


Lys 


Pro Tyr 


Thr 
115 


Cys 


Lys Glu Cys 


Gly 
120 


Lys 


Ala 


Phe 


Asn 


Tyr 
125 


Ser 


Thr 


His Leu 


Lys 
13 0 


He 


His Met Arg 


He 
135 


His 


Thr 


Gly 


Glu 


Lys 
140 


Pro 


Tyr 


Glu Cys 


Lys 
145 


Gin 


Cys Gly Lys 


Ala 
150 


Phe 


Ser 


His 


Ser 


Ser 
155 


Ser 


Phe 


Gin He 


His 
160 


Glu 


Arg Thr His 


Thr 
165 


Gly 


Glu 


Lys 


Pro 


Tyr 
170 


Glu 


Cys 


Lys Glu 


cys 
175 


Gly 


Lys Ala Phe 


Thr 
180 
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<210> 296 

<211> 97 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI:810224. 1 ,orf3 :2000FEB01 
<400> 296 

Leu Cys lie Ser Lys His Pro Gly Glu Lys Pro Tyr Glu Cys Tyr 
15 10 15 

Ala Cys Gly Asn Thr Phe Leu Arg Lys Ser Asp Leu lie Lys His 

20 25 30 

Gin Arg lie His Thr Gly Glu Lys Pro Tyr Glu Cys Asn Glu Cys 

35 40 45 

Gly Lys Ser Phe Ser Glu Lys Ser Thr Leu Thr Lys His Leu Arg 

50 55 60 

Thr His Arg Trp Glu lie Leu Cys Met Tyr Ser Met Trp Lys He 

65 70 75 

Phe Leu Leu Leu Leu Gin Phe His Arg Thr Ser Glu Lys Pro His 

80 85 90 

Arg Gly Glu Thr Phe Trp Met 

95 

<210> 297 
<211> 217 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI: 817052 .2 .orf 1:2000FEB01 



<400> 297 
















Ala 


Ala 


Pro 


Glu 


Ser 


Gin 


Gin Gin Arg 


Asn 


Arg Arg Gly Glu 


Arg 


1 








5 






10 




15 


Pro 


Phe 


Thr 


Cys 


Met 


Glu 


Cys Gly Lys 


Ser 


Phe Arg Leu Lys 


He 










20 






25 




30 


Asn 


Leu 


He 


He 


His 


Gin 


Arg Asn His 


He 


Lys Glu Gly Pro 


Tyr 










35 






40 




45 


Glu 


Cys 


Ala 


Glu 


Cys 


Glu 


He Ser Phe 


Arg 


His Lys Gin Gin 


Leu 










50 






55 




60 


Thr 


Leu 


His 


Gin 


Arg 


He 


His Arg Val 


Arg 


Gly Gly Cys Val 


Ser 










65 






70 




75 


Pro 


Glu 


Arg 


Gly 


Pro 


Thr 


Phe Asn Pro 


Lys 


His Ala Leu Lys 


Pro 










80 






85 




90 


Arg 


Pro 


Lys 


Ser 


Pro 


Ser 


Ser Gly Ser 


Gly 


Gly Gly Gly Pro 


Lys 










95 






100 




105 


Pro 


Tyr 


Lys 


Cys 


Pro 


Glu 


Cys Asp Ser 


Ser 


Phe Ser His Lys 


Ser 










110 






115 




120 


Ser 


Leu 


Thr 


Lys 


His 


Gin 


He Thr His 


Thr 


Gly Glu Arg Pro 


Tyr 










125 






130 




135 


Thr 


cys 


Pro 


Glu 


Cys 


Lys 


Lys Ser Phe 


Arg 


Leu His He Ser 


Leu 










140 






145 




150 


Val 


He 


His 


Gin 


Arg 


Val 


His Ala Gly 


Lys 


His Glu Val Ser 


Phe 










155 




160 




165 


He 


Cys 


Ser 


Leu 


Cys" 


Gly 


Lys Ser Phe 


Ser 


Arg Pro Ser His 


Leu 










170 






175 




180 


Leu 


Arg 


His 


Gin 


Arg 


Thr 


His Thr Gly 


Glu 


Arg Pro Phe Lys 


Cys 










185 






190 




195 


Pro 


Glu 


Cys 


Glu 


Lys 


Ser 


Phe Ser Glu 


Lys 


Ser Lys Leu Thr 


Asn 










200 






205 


210 


His 


Cys 


Arg 


Val 


His 


Ser 


Arg 
















215 













153/228 



WO 01/62927 



PCT/US01/06059 



<210> 298 
<211> 137 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 892274 . 1 .orf 3 :2000MAY19 
<400> 298 



Asp 


Gly 


Gly 


Leu Asp 


Leu Gly 


Pro Thr Asn 


Ser 


Glu Gly He 


Pro 


1 








5 




10 






15 


Ser 


Pro 


Asp 


Leu 


Asn 


Pro Val 


Leu Gly Met 


Gly 


Ser Trp Arg 


His 










20 




25 






30 


lie 


Asp 


Ser 


lie 


Thr 


Pro Gly 


Thr Pro Gly 


Ser 


Ala Gly Leu 


Asp 










35 




40 






45 


Leu 


Pro 


Ala 


Arg 


Glu 


Arg He 


Thr Leu Val 


Gly 


Gly Asp Lys 


Pro 










50 




55 






60 


He 


Lys 


Val 


Pro 


Thr 


Gly He 


Trp Gly Thr 


Ser 


Pro Ala Gly 


Tyr 










65 




70 






75 


Met 


Gly 


Leu 


He 


Leu 


Gly Lys 


Ser Arg Leu 


Asn 


Leu Gin Gly 


Met 










80 




85 






90 


Thr 


Val 


Val 


Pro 


Gly 


Ala Val 


Asp Ser Asp 


Tyr 


Glu Gly Glu 


Thr 










95 




100 






105 


Gin 


Val 


Val 


Leu 


Met 


Ser Gin 


Asp Leu Trp 


Val 


Phe Glu Leu 


Gly 










110 




115 






120 


Glu 


Tyr 


He 


Ala 


Gin 


Leu Leu 


Leu He Pro 


Cys 


Lys Leu His 


Pro 










125 




130 


135 



Ser Pro 



<210> 299 
<211> 169 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1080959 . 1 .orf 2 :2000MAY19 
<400> 299 



Pro 


Lys 


Gin 


Gly 


He 


Asn Val 


Trp Ser 


Pro 


Arg His Pro Glu 


Asn 


1 






5 






10 




15 


Phe 


Leu 


Gly 


He 


Glu 


Ser Arg 


Pro Pro 


Met 


Leu Ser Leu Ser 


Pro 










20 






25 




30 


He 


Leu 


Leu 


Tyr 


Thr 


Cys Glu 


Met Phe 


Gin 


Asp Pro Val Ala 


Phe 










35 






40 




45 


Lys 


Asp 


Val 


Ala 


Val 


Asn Phe 


Thr Gin 


Glu 


Glu Trp Ala Leu 


Leu 








50 






55 


60 


Asp 


He 


Ser 


Gin 


Lys 


Asn Leu 


Tyr Arg 


Glu 


Val Met Leu Glu 


Thr 








65 






70 




75 


Phe 


Trp 


Asn 


Leu 


Thr 


Ser He 


Gly Lys 


Lys 


Trp Lys Asp Gin 


Asn 










80 






85 




90 


He 


Glu 


Tyr 


Glu 


Tyr 


Gin Asn 


Pro Arg 


Arg 


Asn Phe Arg Ser 


Val 










95 






100 




105 


Thr 


Glu 


Glu 


Lys 


Val 


Asn Glu 


He Lys 


Glu 


Asp Ser His Cys 


Gly 










110 






115 




120 


Glu 


Thr 


Phe 


Thr 


Pro 


Val Pro 


Asp Asp 


Arg 


Leu Asn Phe Gin 


Lys 










125 




130 




135 


Lys 


Lys 


Ala 


Ser 


Pro 


Glu Val 


Lys Ser 


Cys 


Asp Ser Phe Val 


Cys 










140 






145 




150 


Glu 


Val 


Gly 


Leu 


Gly 


Asn Ser 


Ser Ser 


Asn 


Met Asn He Arg 


Gly 










155 






160 


165 


Asp 


Thr 


Gly 


His 















<210> 300 



154/228 



WO 01/62927 



PCTYUS01/06059 



<211> 135 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID NO: LG: 1054900 . 1 . orf 3 : 2000MAY19 
<400> 300 



Asp 


Ala 


Trp 


Ala Arg 


Pro 


Pro 


Val 


Leu 


Ser 


Leu 


Ser Pro He Leu 


1 






5 










10 




15 


Leu 


Tyr 


Thr 


Cys 


Glu 


Met 


Phe 


Gin 


Asp 


Pro 


Val 


Ala Phe Asp Asp 








20 










25 




30 


Val 


Ala 


Val 


Asn 


Phe 


Thr 


Gin 


Glu 


Glu 


Trp 


Ala 


Leu Leu Asp He 










35 










40 




45 


Ser 


Gin 


Arg 


Lys 


Leu 


Tyr 


Lys 


Glu 


Val 


'Met 


Leu 


Glu Thr Phe Arg 








50 










55 




60 


Asn 


Leu 


Thr 


Ser 


Val 


Gly 


Lys 


Ser 


Trp 


Lys 


Asp 


Gin Asn He Glu 










65 








70 




75 


Tyr 


Glu 


Tyr 


Gin Asn 


Pro 


Arg 


Arg 


Asn 


Phe 


Arg 


Ser Leu He Glu 








80 










85 




90 


Lys 


Lys 


Val 


Asn 


Glu 


He 


Lys 


Asp 


Asp 


Ser 


His 


Cys Gly Glu Thr 










95 










100 




105 


Phe 


Thr 


Gin 


Val 


Pro 


Asp 


Asp 


Arg 


Leu 


Asn 


Phe 


Gin Glu Lys Lys 










110 








115 




120 


Ala 


Ser 


Pro 


Glu 


He 


Lys 


Ser 


Cys 


Asp 


Ser 


Phe 


Val Cys Gly Lys 



125 130 135 



<210> 301 
<211> 170 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1077357 . 1 . orf 1 : 2000MAY19 
<400> 301 



Thr 


Val 


Met 


Leu Cys 


Asp 


Glu Glu Ala 


Gin 


Lys Arg Lys Ala Lys 


1 






5 






10 


15 


Glu 


Ser 


Gly 


Met Ala 


Leu 


Pro Gin Gly 


Arg 


Leu Thr Phe Met Asp 






20 






25 


30 


val 


Ala 


He 


Glu Phe 


Ser 


Gin Glu Glu 


Trp 


Lys Ser Leu Asp Pro 








35 






40 


45 


Gly 


Gin 


Arg 


Ala Leu 


Tyr 


Arg Asp Val 


Met 


Leu Glu Asn Tyr Arg 








50 






55 


60 


Asn 


Leu 


Val 


Phe Leu 


Gly 


He Cys Leu 


Pro 


Asp Leu Ser He He 








65 


70 


75 


Ser 


Met 


Leu 


Lys Gin 


Arg 


Arg Glu Pro 


Leu 


He Leu Gin Ser Gin 








80 




85 


90 


Val 


Lys 


He 


Val Lys 


Asn 


Thr Asp Gly 


Arg 


Glu Cys Val Arg Ser 








95 






100 


105 


Val 


Asn 


Thr 


Gly Arg 


Ser 


Cys Val Leu 


Gly 


Ser Asn Ala Glu Asn 








110 




115 


120 


Lys 


Pro 


He 


Lys Asn 


Gin 


Leu Gly Leu 


Thr 


Leu Glu Ser His Leu 






125 






130 


135 


Ser 


Glu 


Leu 


Gin Leu 


Phe 


Gin* Ala Gly 


Arg 


Lys He Tyr Arg Ser 








140 






145 


150 


Asn 


Pro 


Val 


Glu Lys 


Phe 


Thr Asn His 


Arg 


Ser Ser Val Ser Pro 








155 






160 


165 


Leu 


Gin 


Lys 


He Ser 














170 











<210> 302 
<211> 181 
<212> PRT 



155/228 



WO 01/62927 



PCTAJS01/06059 



<213> Homo sapi ns 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1084051 . l.orf 3 :2000MAY19 
<400> 302 



Thr 


Ser 


Tyr 


He Arg 


Thr 


Lys Thr Tyr Glu 


Cys 


Asn 


He Cys 


Glu 


1 






5 




10 








15 


Lys 


lie 


Phe 


Lys Gin 


Pro 


He His Leu Thr 


GlU 


His 


Met Arg 


He 






20 




25 






30 


His 


Thr 


Gly 


Glu Lys 


Pro 


Phe Arg Cys Lys 


Glu 


Cys 


Gly Arg 


Ala 








35 




40 








45 


Phe 


Ser 


Gin 


Ser Ala 


Ser 


Leu Ser Thr His 


Gin 


Arg 


He His 


Thr 








50 




55 








60 


Gly 


Glu 


Lys 


Pro Phe 


Glu 


Cys Glu Glu Cys 


Gly 


Lys 


Ala Phe 


Arg 








65 




70 








75 


His 


Arg 


Ser 


Ser Leu 


Asn 


Gin His His Arg 


Thr 


His 


Thr Gly 


Glu 








80 




85 








90 


Lys 


Pro 


Tyr 


Val Cys 


Asp 


Lys Cys Gin Lys 


Ala 


Phe 


Ser Gin 


Asn 








95 




100 








105 


He 


Ser 


Leu 


Val Gin 


His 


Leu Arg Thr His 


Ser 


Gly 


Glu Lys 


Pro 








110 




115 




120 


Phe 


Thr 


Cys 


Asn Glu 


Cys 


Gly Lys Thr Phe 


Arg 


Gin 


He Arg 


His 








125 




130 








135 


Leu 


Ser 


Glu 


His He 


Arg 


lie His Thr Gly 


Glu 


Lys 


Pro Tyr 


Ala 








140 




145 








150 


Cys 


Thr 


Ala 


Cys Cys 


Lys 


Thr Phe Ser His 


Arg 


Ala 


Tyr Leu 


Thr 








155 




160 








165 


His 


His 


Gin 


Arg Ser 


He 


Leu Gly Arg Asp 


Leu 


Gin 


Cys Lys 


Glu 








170 




175 








180 



Cys 



<210> 303 
<211> 263 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1076853 . 1 .orfl :2000MAY19 
<400> 303 



Ala 


Phe 


Ser 


Arg 


cys 


Ser 


Ser 


Leu 


Val 


Gin 


His 


Glu 


Arg Thr 


His 


1 








5 










10 








15 


Thr 


Gly 


Glu 


Lys 


Pro 


Phe 


Glu 


Cys 


Ser 


He 


Cys 


Gly 


Arg Ala 


Phe 










20 










25 








30 


Gly 


Gin 


Ser 


Pro 


Ser 


Leu 


Tyr 


Lys 


His 


Met 


Arg 


He 


His Lys 


Arg 










35 










40 








45 


Gly 


Lys 


Pro 


Tyr 


Gin 


Ser 


Ser 


Asn 


Tyr 


Ser 


He 


Asp 


Phe Lys 


His 










50 










55 








60 


Ser 


Thr 


Ser 


Leu 


Thr 


Gin 


Asp 


Glu 


Ser 


Thr 


Leu 


Thr 


Glu Val 


Lys 










65 










70 








75 


Ser 


Tyr 


His 


Cys 


Asn 


Asp 


Cys 


Gly 


Glu 


Asp 


Phe 


Ser 


His He 


Thr 










80 










85 








90 


Asp 


Phe 


Thr 


Asp 


His 


Gin 


Arg 


He 


His 


Thr 


Ala 


Glu 


Asn Pro 


Tyr 










95 










100 








105 


Asp 


Cys 


Glu 


Gin 


Ala 


Phe 


Ser 


Gin 


Gin 


Ala 


He 


Ser 


His Pro 


Gly 










110 










115 








120 


Glu 


Lys 


Pro 


Tyr 


Gin 


Cys 


Asn 


Val 


Cys 


Gly 


Lys 


Ala 


Phe Lys 


Arg 










125 










13 0 








135 


Ser 


Thr 


Ser 


Phe 


He 


Glu 


His 


His 


Arg 


He 


His 


Thr 


Gly Glu 


Lys 










140 








145 






150 


Pro 


Tyr 


Glu 


Cys 


Asn 


Glu 


Cys 


Gly 


Glu 


Ala 


Phe 


Ser 


Arg Arg 


Ser 










155 








160 








165 


Ser 


Leu 


Thr 


Gin 


His 


Glu 


Arg 


Thr 


His 


Thr 


Gly 


Glu 


Lys Pro 


Tyr 



156/228 



WO 01/62927 





170 






175 










180 


Glu cys lie Asp 


Cys 


Gly Lys Ala 


Phe 


Ser 


Gin 


Ser 


Ser 


Ser 


Leu 




185 




190 










195 


lie Gin His Glu 


Arg 
200 


Thr His Thr 


Gly 


Glu 
205 


Lys 


Pro 


Tyr 


Glu 


Cys 
210 


Asn Glu Cys Gly 


Arg 
215 


Ala Phe Arg 


Lys 


Lys 
220 


Thr 


Asn 


Leu 


His 


Asp 
225 


His Gin Arg lie 


His 
230 


Thr Gly Glu 


Lys 


Pro 
235 


Tyr 


Ser 


Cys 


Lys 


Glu 
240 


Cys Gly Lys Asn 


Phe 
245 


Ser Arg Ser 


Ser 


Ala 
250 


Leu 


Thr 


Lys 


His 


Gin 
255 


Arg lie His Thr 


Arg 
260 


Asn Lys Leu 

















<210> 304 
<211> 340 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LG:481631 . 10 .orf 3 :2000MAY19 
<400> 304 



Arg 


Leu 


Leu 


val 


Pro 


Glu 


Glu 


Glu 


Thr 


Gin 


Lys 


Arg 


Lys 


Arg 


Lys 


1 








5 










10 










15 


Ala 


Lys 


Glu 


Ser 


Gly 


Met 


Ala 


Leu 


Ser 


Gin 


Gly 


Leu 


Leu 


Thr 


Phe 








20 










25 










30 


Arg. 


Asp 


Val 


Ala 


lie 


Glu 


Phe 


Ser 


Gin 


GlU 


Glu 


Trp 


Lys 


Cys 


Leu 










35 










40 










45 


Asp 


Pro 


Ala 


Gin 


Arg 


Thr 


Leu 


Tyr 


Arg 


Asp 


Val 


Met 


Leu 


Glu 


Asn 










50 










55 










60 


Tyr 


Arg 


Asn 


Leu 


Val 


Ser 


Leu 


Asp 


He 


Ser 


Ser 


Lys 


Cys 


Thr 


Met 










65 










70 










75 


Lys 


Glu 


Phe 


Leu 


Ser 


Thr 


Ala 


Gin 


Gly 


Asn 


Arg 


Glu 


Val 


Phe 


His 








80 










85 










90 


Ala 


Gly 


Thr 


Leu 


Gin 


He 


His 


Glu 


Ser 


His 


His 


Asn 


Gly 


Asp 


Phe 










95 










100 










105 


Cys 


Tyr 


Gin 


Asp 


Val 


Asp 


Lys 


Asp 


He 


His 


Asp 


Tyr 


Glu 


Phe 


Gin 










110 










115 










120 


Trp 


Gin 


Glu 


Asp 


Glu 


Arg 


Asn 


Gly 


His 


Glu 


Ala 


Pro 


Met 


Thr 


Lys 










125 










130 










135 


He 


Lys 


Lys 


Leu 


Thr 


Gly 


He 


Thr 


Glu 


Arg 


Tyr 


Asp 


Gin 


Ser 


His 










140 










145 










150 


Ala 


Arg 


Asn 


Lys 


Pro 


He 


Lys 


Asp 


Gin 


Leu 


Gly 


Ser 


Ser 


Phe 


His 










155 










160 










165 


Ser 


His 


Leu 


Pro 


Glu 


Met 


His 


He 


Phe 


Gin 


Thr 


Glu 


Glu 


Lys 


He 










170 










175 








180 


Asp 


Asn 


Gin 


Val 


Val 


Lys 


Ser 


lie 


His 


Asp 


Ala 


Ser 


Leu 


Val 


Ser 








185 








190 










195 


Thr 


Ala 


Gin 


Arg 


He 


Ser 


Cys 


Arg 


Pro 


Lys 


Thr 


His 


He 


Ser 


Asn 








200 










205 










210 


Asn 


His 


Gly 


Asn 


Asn 


Phe 


Trp 


Asn 


Ser 


Ser 


Leu 


Leu 


Thr 


Gin 


Lys 








215 










220 










225 


Gin 


Glu 


Val 


His 


Met 


Arg 


Glu 


Lys 


Ser 


Phe 


Gin 


Cys 


Asn 


Glu 


Ser 










230 










235 










240 


Gly 


Lys 


Ala 


Phe 


Asn 


Tyr 


Ser 


Ser 


Leu 


Leu 


Arg 


Lys 


His 


Gin 


He 










245 










250 










255 


He 


His 


Leu 


Ala 


Asp 


Lys 


Tyr 


Lys 


Cys 


Asp 


Val 


Cys 


Gly 


Lys 


Leu 










260 










265 










270 


Phe 


Asn 


Gin 


Lys 


Arg 


Asn 


Leu 


Ala 


Cys 


His 


Arg 


Arg 


Cys 


His 


Thr 










275 










280 










285 


Gly 


Glu 


Asn 


Pro 


Tyr 


Lys 


Cys 


Asn 


Glu 


Cys 


Gly 


Lys 


Thr 


Phe 


Ser 










290 










295 










300 


Gin 


Thr 


Ser 


Ser 


Leu 


Thr 


Cys 


His 


Arg 


Arg 


Leu 


His. 


Thr 


Gly 


Glu 










305 








310 










315 


Lys 


Pro 


Tyr 


Lys 


Cys 


Glu 


Glu 


Cys 


Asp 


Lys 


Ala 


Phe 


His 


Phe 


Lys 



157/228 



WO 01/62927 PCTYUS01/06059 

320 325 330 

Ser lie Leu Glu Arg His Arg lie lie His 
335 340 



<210> 305 
<211> 89 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1088431 .2 . orf 1 : 2000MAY19 
<400> 305 



Leu Thr Tyr 


Leu Arg 


Lys 


Lys Leu 


Arg 


Gly 


Arg 


Gly Lys Lys Glu 


1 


5 








10 




15 


Glu Glu Gly 


Met Ala 


Leu 


Ser Gin 


Gly 


Leu 


Phe 


Thr Phe Lys Asp 




20 








25 




30 


Val Ala He 


Glu Phe 


Ser 


Gin Glu 


Glu 


Trp 


Glu 


Cys Leu Asp Pro 




35 








40 




45 


Ala Gin Arg 


Ala Leu 


Tyr 


Arg Asp 


Val 


Met 


Leu 


Glu Asn Tyr Arg 




50 








55 




SO 


Asn Leu Leu 


Ser Leu 


Asp 


Glu Asp 


Asn 


He 


Pro 


Pro Glu Asp Gly 




65 








70 




75 


Ser His Leu 


Ala Ala 


cys 


Gly Gin 


Ser 


Thr 


Leu 


Pro Leu Pro 



80 85 



<210> 306 
<211> 80 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 401619 . 10 .orf 1 : 2000MAY01 



<400> 306 

Ala Leu His Pro Leu Pro Lys Arg Gin Ala Leu Glu Lys Ser Asn 

15 10 15 

Gly Thr Ser Ala Val Phe Asn Pro Ser Val Leu His Tyr Gin Gin 

20 25 30 

Ala Leu Thr Ser Ala Gin Leu Gin Gin His Ala Ala Phe He Pro 

35 40 45 

Thr Gly Met Cys Pro Tyr Cys Pro Thr Ser Cys Ala Leu Leu Val 

50 55 60 

Met Cys Phe Leu Leu He Ser Leu Ser Cys Leu Val Ala Ser Ser 

65 70 75 
Leu Leu Leu Lys Val 

80 



<210> 307 
<211> 386 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 1144007 . 1 . orf 2 :2000MAY01 
<400> 307 

Ala Leu Asp Gly Val Arg Pro Pro Gin Ser Gin Glu Met Ala Thr 
15 10 15 

Ala Val Glu Pro Glu Asp Gin Asp Leu Trp Glu Glu Glu Gly He 

20 25 30 

Leu Met Val Lys Leu Glu Asp Asp Phe Thr Cys Arg Pro Glu Ser 

35 ' 40 45 

Val Leu Gin Arg Asp Asp Pro Val Leu Glu Thr Ser His Gin Asn 



158/228 



WO 01/62927 PCT/US01/06059 









50 






55 






60 


Phe 


Arg 


Arg Phe 


Arg 


Tyr 


Gin Glu Ala 


Ala 


Ser 


Pro Arg Glu 


Ala 








65 






70 






75 


Leu 


He 


Arg Leu 


Arg 


Glu 


Leu Cys His 


Gin 


Trp 


Leu Arg Pro 


GlU 








80 






85 






90 


Arg 


Arg 


Thr Lys 


Glu 


Gin 


He Leu Glu 


Leu 


Leu 


Val Leu Glu 


Gin 








95 






100 






105 


Phe 


Leu 


Thr Val 


Leu 


Pro 


Gly Glu Leu 


Gin 


Ser 


Trp Val Arg 


Gly 








110 






115 






120 


Gin 


Arg 


Pro Glu 


Ser 


Gly 


Glu Glu Ala 


Val 


Thr 


Leu Val Glu 


Gly 








125 






130 






135 


Leu 


Gin 


Lys Gin 


Pro 


Arg 


Arg Pro Arg 


Arg 


Trp 


Val Thr Val 


His 








140 






145 






150 


Val 


His 


Gly Gin 


Glu 


Val 


Leu Ser Glu 


Glu 


Thr 


Val His Leu 


Gly 






155 






160 






165 


Ala 


Glu 


Pro Glu 


Ser 


Pro 


Asn Glu Leu 


Gin 


Asp 


Pro Val Gin 


Ser 








170 






175 




180 


Ser 


Thr 


Pro Glu 


Gin 


Ser 


Pro Glu Glu 


Thr 


Thr 


Gin Ser Pro 


Asp 








185 






190 






195 


Leu 


Gly 


Ala Pro 


Ala 


Glu 


Gin Arg Pro 


His 


Gin 


Glu Glu Glu 


Leu 






200 






205 






210 


Gin 


Thr 


Leu Gin 


Glu 


Ser 


Glu Val Pro 


Val 


Pro 


Glu Asp Pro 


Asp 








215 






220 






225 


Leu 


Pro 


Ala Glu 


Arg 


Ser 


Ser Gly Asp 


Ser 


Glu 


Met Val Ala 


Leu 








230 






235 






240 


Leu 


Thr 


Ala Leu 


Ser 


Gin 


Gly Leu Val 


Thr 


Phe 


Lys Asp Val 


Ala 








245 






250 






255 


Val 


Cys 


Phe Ser 


Gin 


Asp 


Gin Trp Ser 


Asp 


Leu 


Asp Pro Thr 


Gin 








260 






265 






270 


Lys 


Glu 


Phe Tyr 


Gly 


Glu 


Tyr Val Leu 


Glu 


Glu 


Arg Leu Trp 


Asn 








275 






280 






285 


Cys 


Cys 


Leu Ser 


Val 


His 


Ser Gin Ser 


Pro 


Arg 


Pro Asp Glu 


He 








290 






295 






300 


Leu 


Pro 


Gly Leu 


Asp 


Glu 


Glu Glu Pro 


Gly 


Val 


Pro Asp He 


Gin 








305 






310 






315 


Glu 


Pro 


Gin Glu 


Thr 


Gin 


Glu Pro Glu 


He 


Leu 


Ser Phe Thr 


Tyr 








320 






325 






330 


Thr 


Gly 


Asp Arg 


Ser 


Lys 


Asp Glu Glu 


Glu 


Cys 


Leu Glu Gin 


Glu 








335 






340 






345 


Asp 


Leu 


Ser Leu 


Glu 


Asp 


He His Arg 


Pro 


Val 


Leu Gly Glu 


Pro 






350 




355 






360 


Glu 


lie 


His Gin 


Thr 


Pro 


Asp Trp Glu 


He 


Val 


Phe Glu Asp 


Asn 








365 






370 






375 


Pro 


Gly 


Arg Leu 


Asn 


Glu 


Arg Arg Phe 


Gly 


Tyr 












380 






385 









<210> 308 
<2H> 368 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 331074 . 1 .orf 2 :2000MAY01 
<400> 308 



Met Cys Ser 


Arg 


Lys 


Lys 


Ala Glu Phe He 


Lys 


Gly Ser His 


Lys 


1 




5 




10 






15 


Cys Asn Val 


Cys 


Ser 


Arg 


Thr Phe Phe ser 


Glu 


Asn Gly Leu 


Arg 






20 




25 






30 


Glu His Leu 


Gin 


Thr 


His 


Arg Gly Pro Ala 


Lys 


His Tyr Met 


Cys 






35 




40 






45 


Pro He cys 


Gly 


Glu 


Arg 


Phe Pro Ser Leu 


Leu 


Thr Leu Thr 


Glu 






50 


55 






60 


His Lys Val 


Thr 


His 


Ser 


Lys Ser Leu Asp 


Thr 


Gly Thr Cys 


Arg 






65 




70 






75 


He Cys Lys 


Met 


Pro 


Leu 


Gin Ser Glu Glu 


Glu 


Phe He Glu 


His 
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80 85 90 



Cys 


Gin 


Met 


His 


Pro 


Asp 


Leu 


Arg 


Asn 


Ser 


Leu 


Thr Gly Phe 


Arg 










95 








100 






105 


Cys 


Val 


Val 


Cys . 


Met 


Gin 


Thr 


Val 


Thr 


Ser 


Thr 


Leu Glu Leu 


Lys 










110 










115 






120 


He 


His 


Gly 


Thr 


Phe 


His 


Met 


Gin 


Lys 


Leu 


Ala 


Gly Ser Ser 


Ala 










125 








130 






135 


Ala 


Ser 


Ser 


Pro 


Asn 


Gly 


Gin 


Gly 


Leu 


Gin 


Lys 


Leu Tyr Lys 


Cys 










140 










145 




150 


Ala 


Leu 


Cys 


Leu Lys 


Glu 


Phe 


Arg 


Ser 


Lys 


Gin 


Asp Leu Val 


Lys 










155 










160 






165 


Leu 


Asp 


Val 


Asn Gly 


Leu 


Pro 


Tyr 


Gly 


Leu 


Cys 


Ala Gly Cys 


Met 










170 










175 






180 


Ala 


Arg 


Arg 


Ala 


Asn 


Gly 


Gin 


Val 


Gly 


Gly 


Leu 


Ala Pro Pro 


Glu 










185 








190 






195 


Pro 


Ala 


Asp 


Arg Pro 


Cys 


Ala 


Gly 


Leu 


Arg 


Cys 


Pro Glu Cys 


Ser 










200 










205 






210 


val 


Lys 


Phe 


Glu 


Ser 


Ala 


Glu 


Asp 


Leu 


Glu 


Ser 


His Met Gin 


Val 








215 








220 






225 


Asp 


His 


Arg 


Asp 


Leu 


Thr 


Pro 


Glu 


Thr 


Ser 


Gly 


Pro Arg Lys 


Gly 










230 










235 






240 


Thr 


Gin 


Thr 


Ser 


Pro 


Val 


Pro 


Arg 


Lys 


Lys 


Thr 


Tyr Gin cys 


He 










245 










250 






255 


Lys 


Cys 


Gin 


Met 


Thr 


Phe 


GlU 


Asn 


Glu 


Arg 


Glu 


He Gin He 


His 






260 










265 






270 


Asp 


Ala 


Asn 


His 


Met 


He 


Glu 


Glu 


Gly 


He 


Asn 


His Glu Cys 


Lys 








275 








280 




285 


Leu 


Cys 


Asn 


Gin 


Met 


Phe 


Asp 


Ser 


Pro 


Ala 


Lys 


Leu Leu Cys 


His 








290 










295 






300 


Leu 


He 


Glu 


His 


Ser 


Phe 


GlU 


Gly 


Met 


Gly 


Gly 


Thr Phe Lys 


Cys 










305 








310 






315 


Pro 


Val 


cys 


Phe 


Thr 


Val 


Phe 


Val 


Gin 


Ala 


Asn 


Lys Leu Gin 


Gin 










320 










325 






330 


His 


He 


Phe 


Ala tVal 


His 


Gly 


Gin 


Glu 


Asp 


Lys 


He Tyr Asp 


Cys 










335 










340 






345 


Ser 


Gin 


Cys 


Pro 


Gin 


Lys 


Phe 


Phe 


Phe 


Gin 


Thr 


Glu Leu Gin 


Asn 








350 








355 






360 


His 


Thr 


Met 


Ser 


Gin 


His 


Ala 


Gin 













365 

<210> 309 
<211> 175 
<212> PRT 
<213> Homo .sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI: 1170349 .l.orf 2: 2000MAY01 
<400> 309 



val 


Thr 


Phe 


Gin 


Asp 


Val 


Ala 


He 


Asp 


Phe 


Ser 


Lys Glu Glu 


Trp 


1 








5 










10 






15 


Gly 


Phe 


Leu 


Asn 


Pro 


Ala 


Gin 


Arg 


Asp 


Leu 


Tyr 


Thr Thr Val 


Met 










20 










25 






30 


Leu 


Glu 


Asn 


Tyr 


Gin 


Asn 


Leu 


Val 


Trp 


Leu 


Gly 


Leu Ser He 


Ser 










35 










40 






45 


Lys 


Ser 


Val 


He 


Ser 


Leu 


Leu 


Glu 


Lys 


Arg 


Lys 


Leu Pro Trp 


He 










50 










55 






60 


Met 


Ala 


Lys 


Glu 


Glu 


He 


Arg 


Gly 


Pro 


Leu 


Pro 


Asp Val Pro 


Gly 










65 










70 






75 


Ala 


Glu 


He 


Lys 


Glu 


Leu 


Ser 


Ala 


Lys 


Arg 


Ala 


He Asn Glu 


Val 










80 








85 






90 


Leu 


Ser 


Gin 


Phe 


Asp 


Thr 


Val 


He 


Lys 


Cys 


Thr 


Arg Asn Val 


Cys 










* 95 










100 






105 


Lys 


Glu 


Cys 


Gly 


Asn 


Leu 


Tyr 


Cys 


His 


Asn 


Met 


'Gin Leu Thr 


Leu 










110 






115 






120 


His 


Lys 


Arg 


Asn 


His 


Thr 


Gin 


Lys 


Lys 


Cys 


Asn Gin cys Leu 


Asp 
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125 

Cys Gly Lys Tyr Phe 
140 

Arg lie His Thr Gly 
155 

Lys Thr Phe Asn Gin 
170 

<210> 310 
<211> 78 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: LG: 335097 . 1 .orf 2 :2000FEB18 

<400> 310 



Thr Gin 


Glu 


Val 


Glu 


Val Ala Val 


Ser Leu Asp Cys 


Cys His Cys 


1 






5 




10 




15 


Thr Pro 


Ala 


Trp 


Val 


Thr Ala Glu 


Leu Cys 


Leu Lys 


Lys Lys Met 








20 




25 




30 


Trp Asn 


Ser 


Phe 


Leu 


Gin Met Phe 


Ser Asn 


Ser He 


Pro Ser Ser 






35 




40 




45 


Val Cys 


Arg 


Tyr 


Met 


Tyr Ala lie 


He Leu 


Gin Val 


He His Val 








50 




55 




60 


Asp Cys 


lie 


Gly 


Asn 


Tyr Arg Lys Asp Tyr 


He Gly Leu Phe Arg 








65 




70 




75 


Lys Tyr 


Phe 















130 135- 

Thr Arg Gin Ser Pro Leu He Gin His Gin 

145 150 

Glu Arg Pro Tyr Lys Cys Asn Glu Cys He 

160 165 
Arg Ala His Leu Thr . 

175 



<210> 311 

<211> 61 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1076451 , 1 .orf 1: 2000FEB18 
<400> 311 

Glu Gin Ser Ser Val Gin Gly Arg Ser Val Glu Val Leu Thr Val 
1 5 10 15 

Gin Val Gin Met Leu Arg Asn Met Ser Pro Ala Met Ser Phe Leu 

20 25 30 

Met Leu Gin Pro Cys Val Asp Gin Ser Ala Ser Gly cys Asp Trp 

35 40 45 

Ser Arg Ala Cys Arg Leu Leu Gin Ser Tyr Phe Ala Gly Val Gly 

50 55 60 

Glu 



<210> 312 

<211> 73 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 805478. 1. orf 1:2000FEB01 
<400> 312 

Gly Leu Phe Gin Cys He His Gin Val Thr Glu Val Gly Gin Lys 
15 10 15 

Val Ala Thr Val Leu Leu Phe Tyr Gly Tyr Tyr Lys Cys Thr Gly 
20 25 30 

Thr Leu Lys He Thr Cys Leu Tyr Asn Val He Leu Tyr Lys Val 
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35 

Cys Ser Pro Gly Ser 
50 

Glu Leu Pro Met ' Thr 
65 

<210> 313 
<211> 184 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: LG : 101269. l.orf 1:2000MAY19 

<400> 313 



Cys 


Cys 


Ser 


Phe 


Lys 


Phe His Phe Asp 


Leu 


Ser 


Trp Glu He 


Leu 


1 








5 




10 






15 


Tip 


Pro 


He 


He 


Pro 


Trp Met Leu Lys 


Met 


Val 


Leu Thr Glu 


Asn 










20 




25 






30 


Pro 


Asn 


Gin 


Glu 


He 


Ala Thr Ser Leu 


Glu 


Phe 


Leu Leu Leu 


Gin 










35 




40 






45 


As n 


Ser 


Pro 


Gly 


Ser 


Leu Arg Ala Gin 


Gin 


Arg 


Met Ser Tyr 


Tyr 










50 




55 






60 


Gly 


Ser 


Ser 


Tyr 


His 


He He Asn Ala 


Asp 


Ala 


Lys Tyr Pro 


Gly 










65 




70 






75 


Tyr 


Pro 


Pro 


Glu 


His 


He He Ala Glu 


Lys 


Arg 


Arg Ala Arg 


Arg 










80 




85 






90 


Arg 


Leu 


Leu 


His 


Lys 


Asp Gly Ser Cys 


Asn 


Val 


Tyr Phe Lys 


His 










95 




100 






105 


lie 


Phe 


Gly 


Glu 


Trp 


Gly Ser Tyr Val 


Val 


Asp 


He Phe Thr 


Thr 










110 




115 






120 


Leu 


Val 


Asp 


Thr 


Lys 


Trp Arg His Met 


Phe 


Val 


He Phe Ser 


Leu 










125 


130 






135 


Ser 


Tyr 


He 


Leu 


Ser 


Trp Leu He Phe 


Gly 


Ser 


Val Phe Trp 


Leu 








140 


145 






150 


He 


Ala 


Phe 


His 


His 


Gly Asp Leu Leu 


Asn 


Asp 


Pro Asp He 


Thr 










155 




160 






165 


Pro 


Cys 


Val 


Asp 


Asn 


Val His Ser Phe 


Thr 


Gly 


Ala Phe Leu 


Phe 






170 




175 






180 


Ser 


Leu 


Glu 


Thr 















40 45 
Asp Gin Pro Asp Val Cys Tyr Asp Pro Ser 
55 60 
Thr Val Phe Lys He Arg Leu Arg 
70 



<210> 314 
<211> 219 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID NO: LI : 33 1087 . 1 .orf 2 :2000MAY01 
<400> 314 



Leu 


Ser 


Gly 


His 


Val 


Gin 


Thr Leu Glu 


Ser 


Pro 


Pro Gin cys 


ser 


1 








5 






10 






15 


Pro 


Ala 


Pro 


Gly 


Gin 


Pro 


Asn Phe Cys 


Leu 


Leu 


Asp Gly Asp 


Gin 










20 




25 






30 


Val 


Ala 


Ala 


Ala 


Gly 


Ala 


Gly Ala Val 


Pro 


Ala 


Gly Val Glu 


Cys 










35 




40 






45 


Leu Gly 


Leu 


Leu 


Val 


Arg 


Gin Arg Gly 


Arg 


Gly 


Gin Lys Cys 


Leu 










50 






55 






60 


Pro 


Ser 


Leu 


Pro 


Gin 


Thr 


Gin Glu Ala 


Gly 


Pro 


Ala Ala Ala 


Leu 










65 






70 






75 


Arg 


Pro 


Arg 


Ser 


Thr 


Pro 


Cys Phe Val 


Tyr 


Gin 


Pro Ala He 


Arg 










80 






85 






90 


Glu 


Ala 


Asn 


Gly 


He 


Val 


Glu Cys Gly 


Pro 


Cys 


Gin Lys Val 


Phe 



95 100 105 
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Val 


Val 


Gin Gin 


He 


Pro 


Asn 


Ser Asn 


Leu 


Leu 


Leu 


Leu Val Thr 








110 








115 






120 


Asp 


Pro 


Thr Cys 


Asp 


Cys 


Ser 


He Phe 


Pro 


Pro 


Val 


Leu Gin Glu 








125 






130 






135 


Ala 


Thr 


Glu Val 


Lys 


Tyr 


Asn 


Ala Ser 


Val 


Lys 


Cys 


Asp Arg Met 








140 






145 






150 


Arg 


Ser 


Gin Lys 


Leu 


Arg 


Arg 


Arg Pro 


Asp 


Ser 


Cys 


His Ala Phe 








155 








160 






165 


His 


Pro 


Glu Val 


Arg 


Val 


Glu 


Ala Asp 


Arg 


Gly 


Trp 


Ala Gly Phe 








170 








175 






180 


Ser 


Ser 


Pro Asn 


Pro 


Leu 


Cys 


Leu Gly 


Leu 


Cys 


Pro 


Cys Arg Gin 








185 








190 






195 


Glu 


His 


lie Gly 


Met 


Pro 


Met 


Asn Thr 


Pro 


Val 


Pro 


Val Leu Leu 






200 








205 






210 


Gly 


Gly 


Asn lie 


Arg 


Val 


Tyr 


Ala Leu 











215 

<210> 315 
<211> 1603 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 410188 . 1 .orf 1 : 2000MAY01 
<400> 315 



Ala 


Ala 


Glu 


Pro 


Ala 


Pro 


Ser Ala Pro 


Ser 


Pro Ala 


Pro Ser Arg 


1 








5 






10 




15 


Val 


Arg 


Ala 


Lys 


Gin 


Gin 


Pro Pro Thr 


Pro 


Gly Pro 


Gly Arg Gly 








20 






25 




30 


Thr 


Ser 


Ser 


Phe 


Pro 


Thr 


Gly Asn Val 


Arg 


Arg Ala 


Cys Ala Gin 










35 




40 




45 


Lys 


Gin 


Asp 


Glu 


Lys 


Met 


Ala Asn Phe 


Leu 


Leu Pro 


Arg Gly Thr 






50 






55 




60 


Ser 


Ser 


Phe 


Arg 


Arg 


Phe 


Thr Arg Glu 


Ser 


Leu Ala 


Ala He Glu 








65 




70 




75 


Lys 


Arg 


Met 


Ala 


Glu 


Lys 


Gin Ala Arg 


Gly 


Ser Thr 


Thr Leu Gin 






80 




85 




90 


Glu 


Ser 


Arg 


Glu 


Gly 


Leu 


Pro Glu Glu 


Glu 


Ala Pro 


Arg Pro Gin 








95 






100 




105 


Leu 


Asp 


Leu 


Gin 


Ala 


Ser 


Lys Lys Leu 


Pro 


Asp Leu 


Tyr Gly Asn 








110 






115 




120 


Pro 


Pro 


Gin 


Glu 


Leu 


lie 


Gly Glu Pro 


Leu 


Glu Asp 


Leu Asp Pro 










125 




130 




135 


Phe 


Tyr 


Ser 


Thr 


Gin 


Lys 


Thr Phe He 


Val 


Leu Asn 


Lys Gly Lys 








140 






145 




150 


Thr 


He 


Phe 


Arg 


Phe 


Ser 


Ala Thr Asn 


Ala 


Leu Tyr 


Val Leu Ser 








155 






160 


165 


Pro 


Phe 


His 


Pro 


Val 


Arg 


Arg Ala Ala 


Val 


Lys lie 


Leu Val His 










170 






175 




180 


Ser 


Leu 


Phe 


Asn 


Met 


Leu 


He Met Cys 


Thr 


lie Leu 


Thr Asn Cys 










185 






190 




195 


Val 


Phe 


Met 


Ala 


Gin 


His 


Asp Pro Pro 


Pro 


Trp Thr 


Lys Tyr Val 










200 






205 




210 


Glu 


Tyr 


Thr 


Phe 


Thr 


Ala 


lie Tyr Thr 


Phe 


Glu Ser 


Leu Val Lys 








215 




220 




225 


He 


Leu 


Ala 


Arg 


Ala 


Phe 


.Cys Leu His 


Ala 


Phe Thr 


Phe Leu Arg 








230 




235 




240 


Asp 


Pro 


Trp 


Asn 


Trp 


Leu 


Asp Phe Ser 


Val 


He He 


Met Ala Tyr 








245 




250 




255 


Thr 


Thr 


Glu 


Phe 


Val 


Asp 


Leu Gly Asn 


Val 


Ser Ala 


Leu Arg Thr 










260 




265 




270 


Phe 


Arg 


Val 


Leu 


Arg 


Ala 


Leu Lys Thr 


He 


Ser Val 


He Ser Gly 








275 






280 




285 


Leu 


Lys 


Thr 


He 


Val 


Gly 


Ala Leu He 


Gin 


Ser Val 


Lys Lys Leu 










290 




295 




300 
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Ala Asp Val 
Leu He Gly 
Val Arg Asn 
Ala Asp Gly 
Pro Glu Asn 
Cys Gly Asn 
Cys Leu Lys 
Asp Ser Phe 
Gin Asp Cys 
Gly Lys He 
Ser Phe Tyr 
Tyr Glu Glu 
Glu Lys Arg 
Glu Ala Leu 
Leu Glu Met 
Ser Lys Arg 
Glu Asp Arg 
Met Asn His 
Lys Pro Arg 
Asp Leu Gly 
Ala Arg Glu 
Pro Leu Arg 
Ser Ala Pro 
Asp' Cys Asn 
Ala Thr Ser 
His Pro Pro 
Gin Met Leu 
Pro Gly Ala 
Ser Ala Leu 
Cys Trp Asn 
Pro Leu Trp 
Asp Pro Phe 
Thr Leu Phe 
Glu Glu Met 



Met 


Val 


Leu 




305 




Leu 


Gin 


Leu 




320 




Phe 


Thr 


Ala 




335 




Leu 


Val 


Trp 




350 




Tyr 


Leu 


Leu 




365 




Ser 


Ser 


Asp 




380 




Ala 


Gly 


Glu 




395 




Ala 


Trp 


Ala 




410 




Trp 


Glu 


Arg 




425 




Tyr 


Met 


He 




440 




Leu 


Val 


Asn 




455 




Gin 


Asn 


Gin 




47 0 




Phe 


Gin 


Glu 




485 




Thr 


He 


Arg 




500 




Ser 


Pro 


Leu 




515 




Arg 


Lys 


Arg 




530 




Leu 


Pro 


Lys 




545 




Leu 


Ser 


Leu 




560 




Ser 


Ser 


Arg 




575 




Ser 


Glu 


Ala 




590 




Ser 


Glu 


Ser 




605 




Arg 


Thr 


Ser 




620 




Gly 


His 


Ala 




635 




Gly 


Val 


Val 




650 




Pro 


Gly 


Ser 




665 




Asp 


Thr 


Thr 


680 




Thr 


Ser 


Gin 




695 




Arg 


Gin 


Arg 




710 




Glu 


Glu 


Leu 




725 




Arg 


Leu 


Ala 


740 




Met 


Ser 


He 




755 




Thr 


Asp 


Leu 




770 




Met 


Ala 


Leu 




785 




Leu 


Gin 


Val 



Thr 


Val 


Phe 


Phe 


Met 


Gly 


Leu 


Asn 


Gly 


Glu 


Ser 


Leu 


Lys 


Asn 


Gly 


Ala 


Gly 


Thr 


Asn 


Pro 


Asp 


Phe 


Leu 


Ala 


Leu 


Tyr 


Gin 


Phe 


Phe 


Met 


Leu 


He 


Leu 


Ala 


Thr 


He 


Ala 


Met 


Glu 


Gly 


Val 


Asp 


Ala 


Pro 


Val 


Met 


Ser 


Ser 


Ser 


Asp 


Ser 


Thr 


Arg 


Gly 


Gly 


Ser 


He 


Asp 


Phe 


Ala 


His 


His 


Thr 


Ala 


Gin 


Gly 


Leu 


His 


Gly 


Ser 


Leu 


Leu 


His 


Leu 


Leu 


inr 


pro 


ber 


Ala 


Pro 


Cys 


Ala 


Leu 


Ser 


VjlU 


LslU 


Ser 


Gin 


Arg 


Tyr 


Lys 


Gin 


Gly 


Thr 


He 


Thr 


Glu 


His 


Tyr 


Gly 


Asn 


Leu 



Cys 


Leu 


Ser 


310 






Asn 


Leu 


Arg 


325 






Thr 


Asn 


Gly 


340 






Asp 


Leu 


Tyr 


355 






Thr 


Ser 


Asp 


370 






Cys 


Pro 


Glu 


385 






His 


Gly 


Tyr 


400 






Leu 


Phe 


Arg 


415 






Gin 


Thr 


Leu 


430 






Leu 


Val 


He 


445 






Ala 


Val 


Val 


460 






Ala 


Glu 


Thr 


475 






Met 


Leu 


Lys 


490 






Thr 


Val 


Ser 


505 






Asn 


Ser 


His 


520 






Gly 


Thr 


Glu 


535 






GlU 


Asp 


Gly 


550 






Leu 


Ser 


Arg 


565 






Phe 


Thr 


Phe 


580 






Asp 


Asp 


Glu 


595 






Ser 


Leu 


Leu 


610 






Gin 


Pro 


Ser 


625 






Lys 


Lys 


Asn 


640 






Gly 


Ala 


Gly 


655 






Arg 


Pro 


Val 


670 






Glu 


Glu 


Pro 


685 






Val 


Asp 


Gly 


700 






Ala 


Val 


Ser 


715 






Arg 


His 


Lys 


730 






Leu 


He 


Trp 


745 




Val 


Lys 


Leu 


760 




Met 


Cys 


He 


775 






Asn 


Met 


Thr 


790 






Val 


Phe 


Thr 



Val 


Phe 


Ala 
315 


His 


Lys 


Cys 
330 


Ser 


Val 


Glu 
345 


Leu 


Ser 


Asp 
360 


Val 


Leu 


Leu 
375 


Gly 


Tyr 


Arg 
390 


Thr 


Ser 


Phe 
405 


Leu 


Met 


Thr 
420 


Arg 


Ser 


Ala 
435 


Phe 


Leu 


Gly 
450 


Ala 


Met 


Ala 
465 


Glu 


GlU 


Lys 
480 


Lys 


GlU 


His 
495 


Arg 


Ser 


Ser 
510 


Glu 


Arg 


Arg 
525 


Glu 


Cys 


Gly 
540 


Pro 


Arg 


Ala 
555 


Thr 


Ser 


Met 
570 


Arg 


Arg 


Arg 
585 


Asn 


Ser 


Thr 
600 


Val 


Pro 


Trp 
615 


Pro 


Gly 


Thr 
630 


Ser 


Thr 


Val 
645 


Asp 


Pro 


Glu 
660 


Met 


Leu 


GlU 
675 


Gly 


Gly 


Pro 
690 


Phe 


Glu 


Glu 
705 


Val 


Leu 


Thr 
720 


Cys 


Pro 


Pro 
735 


Glu 


Cys 


Cys 
750 


Val 


Val 


Met 
765 


Val 


Leu 


Asn 
780 


Ser 


Glu 


Phe 
795 


Gly 


He 


Phe 
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800 



Thr 


Ala 


Glu 


Met Thr 
815 


Phe 


Lys 


Tyr 


Phe 


Gin 


Gin Gly 
830 


Trp 


Asn 


Leu 


Ser 


Leu 


Met Glu 
845 


Leu 


Gly 


Val 


Leu 


Arg 


Ser Phe 
860 


Arg 


Leu 


Ser 


Trp 


Pro 


Thr Leu 


Asn 


Thr 






875 






Val 


Gly 


Ala 


Leu Gly 


Asn 


Leu 






890 






Phe 


He 


Phe 


Ala Leu 


Val* 


Gly 








905 




Arg 


Asn 


Asn 


Arg Lys 
920 


Asn 


He 


Arg 


Trp 


His 


Met His 
935 


Asp 


Phe 


Arg 


He 


Leu 


Cys Gly 
950 


Glu 


Trp 


Glu 


Val 


Gly 


Gin Lys 
965 


Ser 


He 


Met 


Val 


Leu 


Gly Asn 
980 


Leu 


Val 


Leu 


Leu 


Asn 


Ser Phe 
995 


Ser 


Ala 


Asp 


Gly 


Glu 


Val Asn 
1010 


Asn 


Leu 


Arg 


Gly 


Leu 


Arg Phe 
1025 


Val 


Lys 


Gly 


Leu 


Leu 


Arg His 
1040 


Arg 


Pro 


Gin 


Gly 


Gin 


Leu Pro 


Ser 


Cys 






1055 






Pro 


Pro 


Glu 


Thr Glu 
1070 


Lys 


Val 


Phe 


Glu 


Glu 


Gly Glu 
1085 


Gin 


Pro 


Glu 


Pro 


Val 


Cys Val 
1100 


Pro 


He 


Asp 


Gin 


Glu 


Glu Asp 


Glu 


Glu 






1115 






Ser 


Ser 


Lys 


Gin Gin 
1130 


Glu 


Ser 


Gly 


Pro 


Pro 


Asp Ser 
1145 


Arg 


Thr 


Ser 


Ser 


Glu 


Ala Glu 
1160 


Ala 


Ser 


Gin 


Trp 


Lys 


Ala Glu 
1175 


Pro 


Gin 


Glu 


Asp 


Ser 


Cys Ser 
1190 


Glu 


Gly 


Ala 


Glu 


Leu 


Leu Glu 
1205 


Gin 


He 


Asp 


Pro 


Glu 


Asp Cys 
1220 


Phe 


Thr 


Cys 


Cys 


Ala 


Val Asp 
1235 


Thr 


Thr 


Arg 


Leu 


Arg 


Lys Thr 
1250 


Cys 


Tyr 


Glu 


Thr 


Phe 


He He 
1265 


Phe 


Met 


Ala 


Phe 


Glu 


Asp He 
1280 


Tyr 


Leu 


Leu 


Leu 


Glu 


Tyr Ala 


Asp 


Lys 



1295 





805 






810 


He 


He Ala 


Leu Asp 


Pro 


Tyr Tyr 




820 






825 


He 


Phe Asp 


Ser He 


He 


Val He 




835 






840 


Leu 


Ser Arg 


Met Ser 


Asn 


Leu Ser 




850 






855 


Leu 


Arg Val 


Phe Lys 


Leu 


Ala Lys 




865 






870 


Leu 


He Lys 


He He 


Gly 


Asn Ser 




880 






885 


Thr 


Leu Val 


Leu Ala 


He 


He Val 




895 






900 


Lys 


Gin Leu 


Leu Gly 


Glu 


Asn Tyr 




910 






915 


Ser 


Ala Pro 


His Glu 


Asp 


Trp Pro 




925 




930 


Phe 


His Ser 


Phe Leu 


He 


Val Phe 




940 






945 


He 


Glu Asn 


Met Trp 


Ala 


Cys Met 




955 






960 


Cys 


Leu He 


Leu Phe 


Leu 


Thr Val 


970 






975 


Val 


Leu Asn 


Leu Phe 


He 


Ala Leu 




985 






990 


Asp 


Asn Leu 


Thr Ala 


Pro 


Glu Asp 




1000 






1005 


Gin 


Val Ala 


Leu Ala 


Aro 


He Gin 




1015 






1020 


Aro; 


Thr Thr 


Trp Asp 


Phe 


CVS Cvs 




1030 






1035 


Gin 


Lys Pro 


Ala Ala 


Leu 


Ala Ala 




1045 






1050 


He 


Ala Thr 


Pro Tyr 


Ser 


Pro Pro 




1060 




1065 


Pro 


Pro Thr 


Arg Lys 


Glu 


Thr Gin 




1075 




1080 


Gly 


Gin Gly 


Thr Pro 


Gly 


Asp Pro 




1090 






1095 


Ala 


Val Ala 


Glu Ser 


Asp 


Thr Asp 




1105 






1110 


Asn 


Ser Leu 


Gly Thr 


Glu 


Glu Glu 




1120 




1125 


Gin 


Pro Val 


Ser Gly 


Trp 


Pro Arg 




1135 






1140 


Trp 


Ser Gin 


Val Ser 


Ala 


Thr Ala 


1150 






1155 


Ala 


Ser Gin 


Ala Asp 


Trp 


Arg Gin 




1165 




1170 


Ala 


Pro Gly 


Cys Gly 


Glu 


Thr Pro 




1180 






1185 


Ser 


Thr Ala 


Asp Met 


Thr 


Asn Thr 




1195 






1200 


Pro 


Asp Leu 


Gly Gin 


Asp 


Val Lys 




1210 






1215 


Glu 


Gly Cys 


Val Arg 


Arg 


Cys Pro 




1225 






1230 


Gin 


Ala Pro 


Gly Lys 


Val 


Trp Trp 




1240 






1245 


His 


He Val 


Glu His 


Ser 


Trp Phe 




1255 






1260 


He 


Leu Leu 


Ser Ser 


Gly 


Ala Leu 




1270 




1275 


Glu 


Glu Arg 


Lys Thr 


He 


Lys Val 




1285 






1290 


Met 


Phe Thr 


Tyr Val 


Phe 


Val Leu 




1300 




1305 
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Glu Met 


Leu 


Leu Lys 


Trp Val 


Ala Tyr Gly 


Phe 


Lys Lys 


Tyr Phe 






1310 




1315 






1320 


Thr Asn 


Ala 


Trp Cys 


Trp Leu 


Asp Phe Leu 


He 


Val Asp 


Val Ser 






1325 


1330 






1335 


Leu Val 


Ser 


Leu Val 


Ala Asn 


Thr Leu Gly 


Phe 


Ala Glu 


Met Gly 






1340 




1345 






1350 


Pro lie 


Lys 


Ser Leu 


Arg Thr 


Leu Arg Ala 


Leu 


Arg Pro 


Leu Arg 






1355 




1360 






1365 


Ala Leu 


Ser 


Arg Phe 


Glu Gly 


Met Arg Val 


Val 


Val Asn 


Ala Leu 






1370 




1375 






1380 


Val Gly 


Ala 


lie Pro 


Ser He 


Met Asn Val 


Leu 


Leu Val 


Cys Leu 






1385 




1390 






1395 


lie Phe 


Trp 


Leu lie 


Phe Ser 


He Met Gly 


Val 


Asn Leu 


Phe Ala 






1400 




1405 






1410 


Gly Lys 


Phe 


Gly Arg 


Cys He 


Asn Tyr Thr 


Asp 


Gly Glu 


Phe Ser 






1415 




1420 






1425 


Leu Val 


Pro 


Leu Ser 


He Val 


Asn Asn Lys 


Ser 


Asp Cys 


Lys He 






1430 




1435 






1440 


Gin Asn 


Ser 


Thr Gly 


Ser Phe 


Phe Trp Val 


Asn 


Val Lys 


Val Asn 






1445 




1450 






1455 


Phe Asp 


Asn 


Val Ala 


Met Gly 


Tyr Leu Ala 


Leu 


Leu Gin 


Val Ala 




1460 


1465 






1470 


Thr Phe 


Lys 


Gly Trp 


Met Asp 


He Met Tyr 


Ala 


Ala Val 


Asp Ser 






1475 




1480 






1485 


Arg Glu 


Val 


Asn Met 


Gin Pro 


Lys Trp Glu 


Asp 


Asn Val 


Tyr Met 






1490 




1495 






1500 


Tyr Leu 


Tyr 


Phe Val 


He Phe 


He He Phe 


Gly 


Gly Phe 


Phe Thr 






1505 




1510 






1515 


Leu Asn 


Leu 


Phe Val 


Gly Val 


He He Asp 


Asn 


Phe Asn 


Gin Gin 






1520 




1525 






1530 


Lys Lys 


Lys 


Leu Gly 


Gly Gin 


Asp He Phe 


Met 


Thr Glu 


Glu Gin 






1535 




1540 






1545 


Lys Lys 


Tyr 


Tyr Asn 


Ala Met 


Lys Lys Leu 


Gly 


Ser Lys 


Lys Pro 






1550 




1555 






1560 


Gin Lys 


Pro 


lie Pro 


Arg Pro 


Leu Asn Lys 


Phe 


Gin Gly 


Phe Val 






1565 




1570 




1575 


Phe Asp 


lie 


Val Thr 


Arg Gin 


Ala Phe Asp 


He 


Thr He 


Met Val 






1580 




1585 






1590 


Leu lie 


Cys 


Leu Asn 


Met Val 


His His Asp 


Gly 


Gly Asp 






1595 




1600 









<210> 316 
<211> 200 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI: 1188288 .l.orf3 :2000MAY01 
<400> 316 



Gly 


Gly 


Ala 


Gly 


Val 


Arg Gly Ala Gly 


Trp 


Arg Leu Gin 


Gin Arg 


1 








5 




10 




15 


Gly 


Arg 


Gly 


Ala 


Arg 


Ala Pro Pro Ala 


Arg 


Gin Ala Arg 


Gin Gly 










20 




25 




30 


Gly 


Pro 


Arg 


Ala 


Arg 


Pro Leu Glu Gin 


Gin 


Gly Gly Val 


Arg Ala 










35 




40 




45 


Glu 


Arg 


Gly 


Arg 


Gly 


Asp His Trp Ala 


Trp 


Ala Thr Cys 


Gly Ala 










50 




55 




60 


Ser 


Pro 


Thr 


Leu 


Cys 


Tyr Lys Asn Gly 


Gly 


Gly Ala Phe 


Leu He 










65 


70 




75 


Pro 


Tyr 


Val 


Val 


Phe 


Phe He Cys Cys 


Gly 


He Pro Val 


Phe Phe 










80 


85 




90 


Leu 


Glu 


Thr 


Ala 


Leu 


Gly Gin Phe Thr 


Ser 


Glu Gly Gly 


He Thr 










95 


100 




105 


Cys 


Trp 


Arg 


Lys 


Val 


Cys Pro Leu Phe 


Glu 


Gly He Gly 


Tyr Ala 










110 




115 




120 
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Thr Gin Val lie 


Glu 


Ala 


His Leu Asn Val Tyr Tyr He He 


He 




125 




130 


135 


Leu Ala Trp Ala 


He 


Phe 


Tyr Leu Ser Asn Cys Phe Thr Thr 


Glu 




140 




145 


150 


Leu Pro Trp Ala 


Thr 


Cys 


Gly His Glu Trp Asn Thr Glu Asn 


Cys 




155 




160 


165 


Val Glu Phe Gin 


Lys 


Leu 


Asn Val Ser Asn Tyr Ser His Val 


Ser 




170 




175 


180 


Leu Gin Lys Cys 


His 


Leu 


Pro Cys His Gly Val Leu Gly Ala 


Pro 




185 




190 


195 


Gly Pro Gly His 


Leu 
200 









<210> 317 
<211> 329 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> misc__feature 

<223> Incyte ID No: LI : 427 9 97 . 4 .orf 3 : 2000MAY01 
<400> 317 

Arg Thr Gly Gly Gly Gly Gly Ser Gin Ser Pro Ala Pro Asp Gly 
15 10 15 

Glu Arg Thr Met His Cys Leu Gly Ala Glu Tyr Leu Val Ser Ala 
20 25 30 

Asp Gly Ala Pro Arg Gin Arg Glu Trp Arg Pro Gin He Tyr Arg 
,. 35 40 45 

Lys Cys Thr Asp Thr Ala Trp Leu Phe Leu Phe Phe Leu Phe Trp 
50 55 60 

Thr Gly Leu Val Phe He Met Gly Tyr Ser Val Val Ala Gly Ala 
65 70 75 

Ala Gly Arg Leu Leu Phe Gly Tyr Asp Ser Phe Gly Asn Met Cys 
80 85 90 

Gly Lys Lys Asn Ser Pro Val Glu Gly Ala Pro Leu Ser Gly Gin 
95 100 105 

Asp Met Thr Leu Lys Lys His Val Phe Phe Met Asn Ser Cys Asn 

110 115 120 

Leu Glu Val Lys Gly Thr Gin Leu Asn Arg Met Ala Leu Cys Val 

125 130 135 

Ser Asn Cys Pro Glu Glu Gin Leu Asp Ser Leu Glu Glu Val Gin 

140 145 150 

Phe Phe Ala Asn Thr Ser Gly Ser Phe Leu Cys Gly Tyr Ser Leu 

155 160 165 

Asn Ser Phe Asn Tyr Thr His Ser Pro Lys Ala Asp Ser Leu Cys 

170 175 180 

Pro Arg Leu Pro Val Pro Pro Ser Lys Ser Phe Pro Leu Phe Asn 

185 190 195 

Arg Cys Val Pro Gin Thr Pro Glu Cys Tyr Ser Leu Phe Ala Ser 

200 205 210 

Val Leu He Asn Asp Val Asp Thr Leu His Arg He Leu Ser Gly 

215 220 225 

He Met Ser Gly Arg Asp Thr He Leu Gly Leu Cys He Leu Ala 

230 235 240 

Leu Ala Leu Ser Leu Ala Met Met Leu Thr Val Gin He His Thr 

245 250 255 

Pro Pro Phe Trp Phe Thr Phe Ser Phe His Trp Leu Phe Trp Asp 
"260 265 270 

Cys Cys Leu Val Cys Gly Val Leu Trp Trp Leu Tyr Tyr Asp Tyr 

275 280 285 

Thr Asn Asp Leu Ser He Glu Leu Asp Thr Glu Gin Gly Lys Tyr 

290 295 300 

Glu Val Arg Ala Gly Val Cys Tyr Arg Asn Pro Gin Gly He Thr 

305 310 315 

Ala Asp Ala Ala Arg Leu Asp He Leu Phe Ser Glu Arg Glu 

320 325 
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<210> 318 
<211> 256 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 451682. l.orf 3 :2000FEB18 
<400> 318 



His 


Leu 


Ala 


Ala Ala 


Ala Met 


Ser Arg Arg 


Tyr 


Asp Ser Arg Thr 


1 








5 




10 




15 


Thr 


He 


Phe 


Ser 


Pro 


Glu Gly 


Arg Leu Tyr 


Gin 


Val Glu Tyr Ala 










20 




25 




30 


Met 


Glu 


Ala 


He Gly 


Asn Ala 


Gly Ser Ala 


Leu 


Gly He Leu Ala 










35 




40 




45 


Ala 


Asp 


Gly 


Val 


Val 


Leu Val 


Gly Glu Lys 


Lys 


Val Thr Ser Lys 










50 




55 




60 


Leu 


Leu 


Gin 


Thr 


Ser 


Arg Ser 


Ala Glu Lys 


Met 


Tyr Lys He Asp 










65 




70 




75 


Ser 


His 


Leu 


Ala 


Cys 


Ala Val 


Ala Gly He 


Met 


Ser Asp Ala Asn 










80 




85 




90 


lie 


Leu 


He 


Asn 


Thr 


Ala Arg 


Leu His Ala 


Gin 


Arg Tyr Ala Leu 










95 




100 




105 


Ser 


Tyr 


Gin 


Glu 


Pro 


He Pro 


Val Glu Gin 


Leu 


Val Gin Ser Leu 








110 




115 




120 


Cys 


Asp 


Thr 


Lys 


Gin 


Gly Tyr 


Thr Gin Phe 


Gly 


Gly Leu Arg Pro 










125 




130 




135 


Phe 


Gly 


Val 


Ser 


Phe 


Leu Phe 


Ala Gly Trp 


Asp 


Lys His His Gly 










140 




145 




150 


Phe 


Gin 


Leu 


Tyr 


Met 


Ser Asp 


Pro Ser Gly 


Asn 


Tyr Gly Gly Trp 










155 




160 




165 


Lys 


Ala 


Ala 


Ala 


Val 


Gly Ala 


Asn Ser Gin 


Ala 


Ala Gin Ser Met 








170 




175 




180 


Leu 


Lys 


Gin 


Asp Tyr 


Lys Asp 


Ala Leu Thr 


Arg 


Glu Glu Ala Val 








.185 




190 




195 


Gly 


Leu 


Ala 


Leu Lys 


Val Leu 


Ser Lys Thr 


Met 


Asp Ser Thr Ser 








200 




205 




210 


Leu 


Thr 


Ala 


Glu Lys 


Leu Glu 


Leu Ala Glu 


Val 


Phe Leu Gin Pro 










215 




220 




225 


Asp 


Thr 


Gly 


Glu 


Val 


Gin Tyr 


Gin Val Cys 


Ser 


Pro Glu Ala Leu 






230 




235 




240 


Gly 


Lys 


Leu 


Leu 


Ala 


Asn Ser 


Gly Leu Thr 


Gin 


Pro Thr Pro Glu 






245 




250 




255 



Ala 



<210> 319 

<211> 76 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1077283 . 1 .orf 2 :2000FEB18 
<400> 319 

Ala Val Ser Phe Arg Arg Leu Leu Gin Thr Trp Ser Thr Pro Pro 
15 10 15 

Cys Ser Ser Thr Ser Arg Leu Met Ala Ser Leu Trp Val Ala Val 

20 25 30 

Trp Leu Arg Lys Trp Leu Ala Asp Lys Val Pro Lys Thr Ala Glu 

35 40 45 

Asn Phe Arg Ala Leu Ser Thr Gly Glu Lys Gly Phe Gly Tyr Asn 

50 55 ~ 60 

Gly Phe Leu Leu Ser Gin Asn Tyr Ser Arg He His Val Pro Gly 

65 70 75 
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Trp 



<210> 320 
<211> 276 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mi sc_ feature 

<223> Incyte ID No: LG:481436 .5 .orf 3 :2000FEB18 



<400> 320 








Thr 


Asn 


lie 


Lys He 


Thr 


Met 








5 






Ser 


Glv 


Asp 


Glu A fin 


Leu 


Lys 








20 






Lys 


Phe 


His 


Pro Ast> 


Lys 


Asn 






35 






Phe 


Lys 


Lys 


He Gly 


Asn 


Ala 








50 






Lys 


Arg 


Lys 


Gin Tyr 


Asp 


Leu 








65 






Asn 


His 


Gin 


A^n Aqti 


Glv 










80 






Ala 


Asp 


lie 


Thr Pro 


Glu 


Asp 






95 






Glv 


Phe 


Pro 


Ser Gly 


Ser 


Val 






110 






Gly 


Tyr 


Ser 


Gin Gin 


His 


Gin 








125 






Glu 


Glu 


Glu 


Arg Gly 


Asp 


Gly 








140 






Met 


Pro 


He 


He Val 


Leu 


He 








155 






Met 


Val 


Ser 


Asn Pro 


Pro 


Tyr 








170 




Gly 


Gin 


Thr 


He Lys 


Met 


Gin 








185 






Tyr 


Val 


Asn 


Lys Asp 


Phe 


Lys 








200 






Gin 


Lys 


Val 


Glu Lys 


Ser 


Val 








215 






Arg 


Asn 


Asn 


Cys Trp 


Lys 


Glu 








230 






Tyr 


Ala 


Ala 


Lys Val 


Tyr 


Arg 






245 






Asp 


Ala 


Leu 


Ser Met 


Asp 


Asn 








260 






Ser 


Leu 


Tyr 


Lys Gly 


Gly 










275 







Lvs 


Val 


Leu 


Gly 


Val 


Thr Lys Asp 






10 






15 


Lys 


Ala 


Tvr 


Ara 


Lvs 


Leu Ala Leu 






25 






30 


His 


Ala 


Pro 


Gly 


Ala 


Thr Asp Ala 






40 




45 


Tvr 


Ala 


Val 


Leu 


Ser 


Asn Pro Glu 




55 






60 


Thr 


Gly 


Asn 


Glu 


Glu 


Gin Ala cys 




70 






75 


Phe 


Asn 


Phe 


His 


Arg 


Gly Cys Glu 






85 






90 


Leu 


Phe 


Asn 


He 


Phe 


Phe Gly Gly 






100 






105 


His 


Ser 


Phe 


Ser 


Asn 


Gly Arg Ala 






115 






120 


His 


Arg 


His 


Ser 


Gly 


His Glu Arg 






13 0 






135 


Gly 


Phe 


Ser 


Val 


Phe 


He Gin Leu 




145 






150 


Leu 


Val 


Ser 


Leu 


Leu 


Ser Gin Leu 






160 






165 


Ser 


Leu 


Tyr 


Pro 


Arg 


Ser Gly Thr 






175 






180 


Thr 


Glu 


Asn 


Leu 


Gly 


Val Val Tyr 






190 






195 


Asn 


Glu 


Tyr 


Lys 


Gly 


Met Leu Leu 






205 






210 


Glu 


Glu 


Asp 


Tyr 


Val 


Thr Asn He 






220 




225 


Arg 


Gin 


Gin 


Lys 


Thr 


Asp Met Gin 




235 






240 


Asp 


Asp 


Arg 


Leu 


Arg 


Arg Lys Ala 






250 






255 


Cys 


Lys 


Glu 


Leu 


Glu 


Arg Leu Thr 






265 






270 



<210> 321 
<211> 115 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LI :793701 . 1 .orf 1:2000FEB01 
<400> 321 

Gin Ala Leu Leu Gin Ser His Pro Glu Ala Asp Trp Ser Thr His 
15 10 15 

Ser Arg Ser Met Arg Lys Leu He Val Arg Phe He Phe Leu Lys 
20 25 30 

Phe Trp Thr Tyr Thr Val Arg Ala Ser Thr Asp Leu Thr Gin Thr 
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35 


40 


45 


Gly Asp Cys Ser 


Gin 


Cys He His Gin Val 


Thr Glu Val Gly Gin 




50 


55 


60 


Gin lie Lys Thr 


He 


Phe Leu Phe Tyr Ser 


Tyr Tyr Glu Cys Val 


65 


70 


75 


Glu Thr He Lys 


Lys 


Leu Val Cys He Met 


Pro Leu Ser Thr Arg 




80 


85 


90 


Tyr Val Ala Arg 


Glu 


Met Thr Asp Leu Met 


Arg Val He Thr His 




95 


100 


105 


Leu Ser Pro Pro 


Gin 


Pro Pro Phe Leu Lys 






110 


115 





<210> 322 
<211> 227 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 373 637 . 1 .orf 3 :2000FEB01 
<400> 322 



His 


Pro 


Ala 


Trp Trp 


Thr 


Thr Thr Arg 


Cys 


Trp 


Thr Cys Pro Gly 


1 






5 






10 




15 


Arg 


Pro 


His 


Pro Arg 


Pro 


Ser Arg Arg 


Arg 


Thr 


Ala Ser Trp Arg 








20 






25 




30 


Ser 


Ser 


Gly 


Thr Pro 


Thr 


Lys Thr Leu 


Arg 


Thr 


Arg Arg Lys Arg 








35 






40 




45 


Arg 


Gly 


Asp 


Ser Ser 


Arg 


Trp Pro Arg 


Pro 


Thr 


Arg Cys Cys Arg 








50 






55 




60 


Thr 


Pro 


Arg 


Asn Ala 


He 


Ser Met Thr 


Ala 


Met 


Ala Arg Arg Gly 






65 






70 




75 


Arg 


Arg 


Ala 


Ala Ala 


Gin 


Ala Ala Gly 


Pro 


Ser 


Arg Thr Pro Ser 






80 






85 




90 


Ser 


Thr 


Ser 


Ser Ala 


Ser 


Ala Thr Gin 


Pro 


Thr 


Ser Ser Gly Ser 








95 






100 




105 


Ser 


Ser 


Ala 


Ala Arg 


Thr 


His Ser Pro 


Leu 


Thr 


Ser Trp Glu Thr 








110 






115 




120 


Arg 


Trp 


Arg 


He Phe 


Trp 


Gly Gly Gin 


Arg 


Asn 


Cys Trp Gly Ser 








125 






130 




135 


Arg 


Ser 


Arg 


Ala Ser 


Ala 


Pro Leu Phe 


Ser 


Ala 


Phe Ser Glu Phe 




140 






145 




150 


Pro 


Ala 


Phe 


Gly Gly 


Val 


Phe Ser Ser 


Phe 


Asp 


Thr Gly Phe Arg 








155 






160 




165 


Ser 


Phe 


Gly 


Ser Leu 


Gly 


Ser Gly Gly 


Leu 


Ser 


Ser Phe Cys Met 








170 






175 




180 


Ser 


Tyr 


Gly 


Ser Asp 


Gly 


Thr Gly Ser 


Phe 


Lys 


Ser Met Ser Thr 








185 






190 




195 


Ser 


Thr 


Glu 


He Val 


Asp 


Gly Lys Lys 


He 


Thr 


Thr Lys Arg He 








200 






205 




210 


He 


Glu 


Asn 


Gly Gin 


Glu 


Arg Val Glu 


Val 


Glu 


Glu Asp Gly Glu 








215 






220 




225 



Leu Ser 



<210> 323 
<211> 100 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 239368 . 2 .orf 3 :2000MAY19 
<400> 323 

Glu Glu Ser Val Leu Arg Gly Lys Phe Leu Phe Thr Ser Gly He 
1 5 10 15 



170/228 



WO 01/62927 PCT/US01/06059 



Pro Arg Ala Ser 


Trp 


Val 


Asp 


Ser Gly Leu 


His 


Thr Gin Pro Gly 




20 






25 




30 


Ser Pro Gly Ser 


Ala 


Ser 


Val 


Pro Pro Leu 


Ser 


Gly Pro Gly Cys 




35 






40 




45 


Gly Leu Gly Ala 


Arg 


Pro 


Ser 


Leu Ala Pro 


Gly 


Asn Ser Asp Val 




50 






55 




60 


Phe Leu His Leu 


Leu 


Pro 


Leu 


Leu Arg Gly 


Pro 


Lys Pro Gly Lys 




65 






70 




75 


Pro Ala Asp Ala 


Ser 


His 


Pro 


Glu Asn Cys 


Glu 


Gin Thr His Arg 




80 






85 




90 


Ala Ser Pro Thr 


Pro 


Glu 


Ser 


Ser Cys Cys 








95 






100 







<210> 324 
<211> 142 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI:053826.1.orf3:2000MAY01 
<400> 324 



Asp 


Phe 


Trp 


Ala Lvs 


He Tyr Leu Tyr 


Ala 


Leu 


Glu Glv Arg Lys 


1 








5 


10 




15 


Tyr 


Arg 


Ser 


He 


Leu 


Gin Leu Val Lys 


Pro 


Trp 


Tyr Asp Glu Val 










20 




25 




30 


Lys 


Asp 


Tyr 


Ala 


Phe 


Pro Tyr Pro Gin 


Asp 


Cys 


Asn Pro Arg Cys 










35 




40 




45 


Pro 


Met 


Arg 


Cys 


Phe 


Gly Pro Met Cys 


Thr 


His 


Tyr Thr Gin Met 










50 


55 




60 


Val 


Trp 


Ala 


Thr 


Ser 


Asn Arg He Gly 


Cys 


Ala 


He His Thr Cys 










65 




70 




75 


Gin 


Asn 


Met 


Asn 


Val 


Trp Gly Ser Val 


Trp 


Arg 


Arg Ala Val Tyr 










80 




85 




90 


Leu 


Val 


Cys 


Asn 


Tyr 


Ala Pro Lys Gly 


Asn 


Trp 


He Gly Glu Ala 










95 




100 




105 


Pro 


Tyr 


Lys 


Val 


Gly 


Val Pro Cys Ser 


Ser 


Cys 


Pro Pro Ser Tyr 










110 




115 




120 


Gly 


Gly 


Ser 


Cys 


Thr 


Asp Asn Leu Cys 


Phe 


Pro 


Gly Val Thr Ser 










125 




130 




135 


Asn 


Tyr 


Leu 


T*yr Trp 


Phe Lys 









140 

<210> 325 
<211> 263 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI :449393 . 1 .orf3 :2000MAY01 
<400> 325 



Ala Met 


Ser 


Leu 


Arg 


Val 


Leu 


Asn Pro 


Asn 


Ala Glu Val Leu Asn 


1 






5 








10 


15 


Lys Ser 


Ala 


Ala 


Leu 


His 


Met 


Asn He 


Asn 


Ala Ala Lys Gly Leu 








20 








25 


30 


Gin Asp 


Val 


Leu 


Lys 


Thr 


Asn 


Leu Gly 


Pro 


Lys Gly Thr He Lys 








35 








40 


45 


Met Leu 


Val 


Gly 


Gly 


Ala 


Gly 


Asp Leu 


Lys 


Leu Thr Lys Asp Gly 








50 








55 


60 


Asn Thr 


Leu 


Leu 


Lys 


Glu 


Met 


Gin He 


Gin 


Asn Pro Thr Ala He 








65 








70 


75 


Met He 


Ala 


Arg 


Thr 


Ala 


Val 


Ala Gin 


Asp 


Asp Thr Ser Gly Asp 








80 








85 


90 


Gly Thr 


Thr 


Ser 


Thr 


Val 


Leu 


Phe He 


Gly 


Glu Leu Met Lys Gin 
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WO 01/62927 
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95 100 105 



Ser 


Glu 


Arg Cys 


He 


Asp 


Glu 


Gly Thr 


His Pro Arg Phe Leu Val 








110 








115 120 


Asp 


Gly 


Phe Asp 


Val 


Ala 


Lys 


Arg Ala 


Cys Leu Asp Phe Leu Asp 








125 








130 135 


Lys 


Phe 


Lys Thr 


Pro 


Val 


Val 


Thr Gly 


Glu Glu Pro Asp Arg Asp 






140 








145 150 


Thr 


Leu 


Lys Met 


Val 


Ala 


Arg 


Thr Thr 


Leu Arg Thr Lys Leu Tyr 








155 








160 165 


Glu 


Gly 


Leu Ala 


Asp 


Gin 


Leu 


Thr Asp 


He Val Val Asn Ala Val 








170 








175 180 


Leu 


Cys 


He Arg 


Lys 


Pro 


Asp 


Glu Pro 


He Asp Leu Phe Met Val 






185 






190 195 


Glu 


He 


Met His 


Met 


Arg 


His 


Lys Phe 


Asp Val Asp Thr Arg Leu 








200 








205 210 


Val 


Glu 


Gly Leu 


Val 


Leu 


Asp 


His Gly 


Ser Arg His Pro Asp Met 






215 








220 225 


Lys 


Arg 


Arg Ala 


Glu 


Asn 


Cys 


Tyr He 


Leu Thr Cys Asn Val Ser 








230 








235 240 


Leu 


Glu 


Tyr Glu 


Lys 


Ser 


Glu 


He Asn 


Ala Gly Phe Phe Tyr Ser 






245 








250 255 


Asn 


Ala 


Glu Gin 


Lys 


Lys 


Lys 


Lys 





260 

<210> 326 
<211> 357 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI:1071427 . 96 .orf 2 :2000MAY01 
<400> 326 



Asp 


Glu 


Thr 


Arg 


Glu 


Asp 


Ala 


Met 


Ala 


Met 


Val 


Asp 


His Cys Leu 


1 








5 










10 






15 


Lys 


Lys 


Ala 


Leu 


Trp 

20 


Phe 


Gin 


Gly 


Arg 


Cys 
25 


Val 


Lys 


Val Asp Leu 
30 


Ser 


Glu 


Lys 


Tyr 


Lys 
35 


Lys 


Leu 


Val 


Leu 


Arg 
40 


He 


Pro 


Asn Arg Gly 
45 


He 


Asp 


Leu 


Leu 


Lys 
50 


Lys 


Asp 


Lys 


Ser 


Arg 
55 


Lys 


Arg 


Ser Tyr Ser 
60 


Pro 


Asp 


Gly 


Lys 


Glu 
65 


Ser 


Pro 


Ser 


Asp 


Lys 

70 


Lys 


Ser 


Lys Thr Asp 
75 


Gly 


Ser 


Gin 


Lys 


Thr 


Glu 


Ser 


Ser 


Thr 


Glu 


Gly 


Lys 


Glu Gin Glu 






80 










85 






90 


Glu 


Lys 


Ser 


Gly 


Glu 
95 


Asp 


Gly 


Glu 


Lys 


Asp 
100 


Thr 


Lys 


Asp Asp Gin 
105 


Thr 


Glu 


Gin 


Glu 


Pro 
110 


Asn 


Met 


Leu 


Leu 


Glu 
115 


Ser 


Glu 


Asp Glu Leu 
120 


Leu 


Val 


Asp 


Glu 


Glu 


Glu 


Ala 


Ala 


Ala 


Leu 


Leu 


Glu 


Ser Gly Ser 








125 










130 






135 


Ser 


Val 


Gly 


Asp 


Glu 
140 


Thr 


Asp 


Leu 


Ala 


Asn 
145 


Leu 


Gly 


Asp Val Ala 
150 


Ser 


Asp 


Gly 


Lys 


Lys 
155 


Glu 


Pro 


Ser 


Asp 


Lys 
160 


Ala 


Val 


Lys Lys Asp 
165 


Gly 


Ser 


Ala 


Ser 


Ala 
170 


Ala 


Ala 


Lys 


Lys 


Lys 
175 


Leu 


Lys 


Lys Arg Arg 
180 


Phe 


Pro 


Gly 


Ser 


Met 
185 


Glu 


Gly 


Phe 


Val 


Thr 
190 


Leu 


Asp 


Glu Val Gly 
195 


Asp 


Glu 


Glu 


Asp 


Ser 


Glu 


Leu 


Gin 


Lys 


Leu 


Arg 


Lys 


Ser Gly Met 










200 








205 






210 


Ala 


Phe 


Lys 


Ser 


Gly 
215 


Asp 


Lys 


Asn 


Asp 


Asp 
220 


Gly 


Leu 


Val Glu He 
225 


Lys 


Val 


Asp 


Lys 


He 


Glu 


Glu 


Leu 


Asp 


Gin 


GlU 


Asn 


Glu Ala Ala 








230 








235 






240 


Leu 


Glu 


Asn 


Gly 


He 


Lys 


Asn 


Glu 


Glu 


Asn 


Thr 


Glu 


Pro Gly Ala 



172/228 



WO 01/62927 
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245 








250 








255 


Glu Ser 


Ser Glu 


Asn 


Ala 


Asp 


Asp Pro 


Asn 


Lys 


Asp Thr 


Ser 


Glu 






260 








265 








270 


Asn Ala 


Asp Gly 


Gin 


Ser 


Asp 


Glu Asn 


Lys 


Asp 


Asp Tyr Thr 


He 






275 








280 








285 


Pro Asp 


Glu Tyr 


Arg 


lie 


Gly 


Pro Tyr 


Gin 


Pro 


Asn Val 


Pro 


Val 






290 








295 








300 


Gly lie 


Asp Tyr 


Val 


lie 


Pro 


Lys Thr 


Gly 


Phe 


Tyr Cys 


Lys 


Leu 






305 








310 








315 


Cys Ser 


Leu Phe 


Tyr 


Thr 


Asn 


Glu Glu 


Val 


Ala 


Lys Asn 


Thr 


His 






320 








325 








330 


Cys Ser 


Ser Leu 


Pro 


His 


Tyr 


Gin Lys 


Leu 


Lys 


Lys Phe 


Leu 


Asn 






335 








340 








345 


Lys Leu 


Ala Glu 


Glu 


Arg 


Arg 


Gin Lys 


Lys 


Glu 


Thr 







350 355 

<210> 327 
<211> 100 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI:336338.8.orf2:2000MAY01 
<220> 

<221> unsure 
<222> 10, 18, 30 
<223> unknown or other 

<400> 327 



Met He 


Ser 


Ser Asn 


Ser 


Pro 


Asn Leu 


Xaa 


Leu Trp Pro He 


Thr 


1 




5 








10 




15 


Thr Phe 


Xaa 


His Val 


Cys 


Thr 


Ser Cys 


Cys 


Ser Arg Leu Gin 


Xaa 






20 




25 




30 


Pro Phe 


Ser 


Leu Ala 


Asp 


Phe 


Trp Lys 


Ser 


Asn Gly Arg Val 


Leu 






35 








40 




45 


Gly Gly 


Arg 


Arg Leu 


Leu 


Tyr 


Ala cys 


Glu 


Lys Glu Gin Ser 


Val 






50 








55 




60 


Pro Thr 


Glu 


Gly Ser 


Ser 


Thr 


Thr Leu 


Leu 


Gin Asn Met Tyr 


He 






65 








70 




75 


Ser Arg 


Leu 


Ser Ser 


His 


Leu 


Arg Phe 


Leu 


Cys Ser Cys Arg 


Leu 






80 








85 




90 


He Asp 


Tyr 


Ser He 


Leu 


Leu 


Lys Arg 


Lys 










95 








100 







<210> 328 
<211> 303 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc__feature 

<223> Incyte ID No: LG : 345527. l.orf 2: 2000FEB18 
<400> 328 



Arg Glu 


Leu 


Lys 


Arg 


Phe Asn Ala Asp Asn 


Lys 


Leu Leu Leu 


Thr 


1 




5 


10 






15 


Gly Thr 


Pro 


Leu 


Gin 


Asn Asn Leu Ser Glu 


Leu 


Trp Ser Leu 


Leu 








20 


25 




30 


Asn Phe 


Leu 


Leu 


Pro 


Asp Val Phe Asp Asp 


Leu 


Lys Ser Phe 


Glu 








35 


40 






45 


Ser Trp 


Phe 


Asp 


He 


Thr Ser Leu Ser Glu 


Thr 


Ala Glu Asp 


He 








50 


55 






60 


He Ala 


Lys 


Glu 


Arg 


Glu Gin Asn Val Leu 


His 


Met Leu His 


Gin 








65 


70 






75 


He Leu 


Thr 


Pro 


Phe 


Leu Leu Arg Arg Leu 


Lys 


Ser Asp Val 


Ala 
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Leu 


Glu 


Val 


Pro 


Pro 


Lys 


Arg 










95 




Ser 


Lys 


Lys 


Gin 


Glu 


He 


Phe 








110 






lie 


Ala 


Asn 


Met 


.Phe 


Gly 


Ser 










125 




Ser 


Pro 


Thr 


Gly 


Arg 
140 


Pro 


Lys 


Tyr 


Ser 


Lys 


He 


Asp 
155 


Asp 


Phe 


Ser 


Gin 


He 


Gin 


Pro 
170 


Glu 


Val 


Val 


Asn 


He 


Pro 


Val 
185 


Glu 


Ser 


He 


Met 


Met 


Leu 


Leu 


Arg 


Lys 










200 




Glu 


Tyr 


Pro 


He 


Asp 


Pro 


Val 








215 






Glu 


Leu 


Val 


Thr 


Asn 


Ser 


Gly 










23 0 




Leu 


Pro 


Glu 


Leu 


Lys 
245 


Lys 


Arg 


Gin 


Met 


Thr 


Ser 


Met 
260 


Leu 


Asp 


Arg 


Asp 


Phe 


Asn 


Phe 
275 


Ser 


Arg 


Arg 


Glu 


Lys 


Lys 


Thr 
290 


Cys 


Thr 


Leu 


Ser 


Ser 















85 






90 


Glu 


Val 


Val 


Val 


Tyr Ala Pro 


Leu 






100 






105 


Tyr 


Thr 


Ala 


He 


Val Asn Arg 


Thr 




115 




120 


Ser 


Glu 


Lys 


Glu 


Thr He Glu 


Leu 






130 






135 


Arg 


Arg 


Thr 


Arg 


Lys Ser He 


Asn 






145 






150 


Pro 


Asn 


Glu 


Leu 


Glu Lys Leu 


He 






160 






165 


Asp 


Arg 


Glu 


Arg 


Ala Val Val 


Glu 






175 






180 


Glu 


Val 


Asn 


Leu 


Lys Leu Gin 


Asn 






190 




195 


Cys 


Cys 


Asn 


His 


Pro Tyr Leu 


He 




205 






210 


Thr 


Gin 


Glu 


Phe 


Lys He Asp 


Glu 






220 




225 


Lys 


Phe 


Leu 


He 


Leu Asp Arg 


Met 






235 






240 


Gly 


His 


Lys 


Val 


Leu Leu Phe 


Ser 




250 






255 


He 


Leu 


Met 


Asp 


Tyr Cys His 


Leu 






265 


270 


Leu 


Met 


Gly 


Pro 


Cys Leu Thr 


Gin 






280 






285 


Ala 


Ser 


Thr 


Arg 


He Gin Arg 


Cys 






295 






300 



<210> 329 

<211> 72 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG:1089383 .l.orf2 :2000FEB18 
<400> 329 



Thr Ala Leu Leu 


Leu 


Thr Gin Ser Leu Phe 


Gly Ser Leu Phe 


Thr 


1 


5 


10 




15 


Trp Thr Arg Val 


Thr 


Phe Gly Ala Glu Asp 


Pro Gly Gin Glu 


Asp 




20 


25 




30 


Ser Phe Arg Arg 


Arg 


Val Pro Cys Pro Cys 


Pro His Ser Val 


Arg 




35 


40 




45 


Arg Ser Thr Tyr 


Asp 


Leu Arg Ser Ser Asp 


Gin Pro Ala Gin 


Gly 


50 


55 




60 


Thr Ser His Glu 


Phe 


Gin He Gly Phe Pro 


Thr He 






65 


70 







<210> 330 

<211> 76 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1092522 . 1 . orf 2 :2000FEB18 
<400> 330 

Phe Ser Tyr Leu Ser Ser Lys Trp Val Val Lys Gin Gin Arg Gin 
1 5 10 .15 

Leu Ala He Ser Thr Met His Leu Ala Gin Glu Leu Leu Met Asn 
20 25 30 
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Val Gin Cys Ser Gly Gly Ser Arg His Phe Ser Lys Glu Met Arg 
35 40 45 

Thr Leu Lys Met Arg Ser lie Val Ala Lys Pro Leu Glu Val Asp 
50 55 60 

Asn Asp Gin Leu Arg Ala lie Ser Lys Ala Asp Pro Leu Lys Ala 
65 70 75 

Thr 



<210> 331 

<211> 74 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1093216 . 1 . orf 2 : 2000FEB18 
<220> 

<221> unsure 
<222> 2, 17, 36' 
<223> unknown or other 



<400> 331 

Gly Xaa Pro Pro Thr Thr Ser Gly Pro Gin Thr Asn Gin Pro Lys 
15 10 15 

Glu Xaa Leu Met Asn Phe Lys Ser Asp Ser Gin Leu Tyr Glu Asp 

20 25 30 

Thr Leu Ala Gly Arg Xaa Val Leu lie Lys Asn Leu Thr Pro Gin 

35 40 45 

Thr Leu Gin Pro Arg Trp Thr Gly Pro Tyr Leu Val lie Tyr Ser 

50 55 60 

Thr Pro Thr Ala Val Arg Leu Gin Asp Pro Pro His Trp Val 

65 70 



<210> 332 

<211> 67 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 270318 .3 .orf 3 : 2000FEB01 



<400> 332 

Leu Ser Phe Lys Arg Asp Ser Trp Glu Tyr Gly His Pro Ala Pro 
15 10 15 

Arg Cys Gly Asn Glu Ser Ser Arg Ser Gly Glu Ala Ala Leu Ala 

20 25 30 

Asp Val Gin Leu Ala Ala Pro Val Ser Asn Gin Leu His Pro Asp 

35 40 45 

Gly Val Glu Asp Arg Gly Val Gly Gly Leu Leu Arg Ser Tyr Thr 

50 55 60 

Thr Gin Leu Thr Met Asn lie 

65 

<210> 333 
<211> 192 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI:335671.2 .orf 2 :2000FEB01 
<400> 333 

Trp Val His Val Leu Leu Arg Glu Arg Lys Lys His Ala Gin Leu 



175/228 
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1 






5 










10 




15 


Gill 


His 


Gly Ser 


Arg 


Gly 


Val 


Tyr 


Leu 


Leu 


Val 


Ser Thr Arg Ala 








20 










25 




30 


Gly 


Gly 


Leu Gly 


He 


Asn 


Leu 


Thr 


Ala 


Ala 


Asp 


Thr Val He He 








35 










40 




45 


Tyr 


Asp 


Ser Asp 


Trp 


Asn 


Pro 


Gin 


Ser 


Asp 


Leu 


Gin Ala Gin Asp 








50 










55 




60 


Arg 


Cys 


His Arg 


lie 


Gly 


Gin 


Thr 


Lys 


Pro 


Val 


Val Val Tyr Arg 








65 










70 




75 


Leu 


Val 


Thr Ala 


Asn 


Thr 


He 


Asp 


Gin 


Lys 


He 


Val Glu Arg Ala 








80 








85 




90 


Ala 


Ala 


Lys Arg 


Lys 


Leu 


Glu 


Lys 


Leu 


He 


He 


His Lys Asn His 






95 








100 




105 


Phe 


Lys 


Gly Gly 


Gin 


Ser 


Gly 


Leu 


Asn 


Leu 


Ser 


Lys Asn Phe Leu 








110 










115 




120 


Asp 


Pro 


Lys Glu 


Leu 


Met 


Glu 


Leu 


Leu 


Lys 


Ser 


Arg Asp Tyr Glu 








125 










130 




135 


Arg 


Glu 


He Lys 


Gly 


Ser 


Arg 


Glu 


Lys 


Val 


He 


Ser Asp Lys Asp 








140 










145 




150 


Leu 


Glu 


Leu Leu 


Leu 


Asp 


Arg 


Ser 


Asp 


Leu 


lie 


Asp Gin Met Asn 








155 






160 




165 


Ala 


Ser 


Gly Pro 


He 


Lys 


Glu 


Lys 


Met 


Gly 


He 


Phe Lys He Leu 








17 0 










175 




180 


Glu 


Asn 


Ser Glu 


Asp 


Ser 


Ser 


Pro 


Glu 


Cys 


Leu 


Phe 








185 










190 







<210> 334 

<211> 74 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI :793758 . 1 .orf 2 :2000FEB01 
<220> 

<221> unsure 
<222> 36 

<223> unknown or other 
<400> 334 

Gly Asp Pro Pro Thr Thr Ser Gly Pro Gin Thr Asn Gin Pro Lys 
15 10 15 

Glu His Leu Met Asn Phe Lys Ser Asp Ser Gin Leu Tyr Glu Asp 

20 25 30 

Thr Leu Ala Gly Arg Xaa Val Leu He Lys Asn Leu Thr Pro Gin 

35 40 45 

Thr Leu Gin Pro Arg Trp Thr Gly Pro Tyr Leu Val He Tyr Ser 

50 55 60 

Thr Pro Thr Ala Val Arg Leu Gin Asp Pro Pro His Trp Val 

65 70 

<210> 335 

<211> 72 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: LI: 803718 .1. orf 2:2000FEB01 
<220> 

<221> unsure 
<222> 41 

<223> unknown or other 



<400> 335 
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Thr Ala Leu Leu Leu Thr Gin Ser Leu Phe Gly Ser Leu Phe Thr 
1 5 10 15 

Trp Thr Arg Val Thr Phe Gly Ala Glu Asp Pro Gly Gin Glu Asp 

20 25 30 

Ser Phe Arg Arg Arg Val Pro cys Pro cys Xaa His Ser Val Arg 

35 40 45 

Arg Ser Thr Tyr Asp Leu Arg Ser Ser Asp Gin Pro Ala Gin Gly 

50 55 60 

Thr Ser His Glu Phe Gin lie Gly Phe Pro Thr lie 

65 70 

<210> 336 

<211> 55 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI :412179 . 1 .orf 2 : 2000FEB01 
<400> 336 

Thr He Glu Met Met Leu Asp He Lys Gin He Gin Val He Phe 
1*5 10 15 

Leu Phe Glu Phe Lys Met Gly Arg Lys He Ala Glu Thr Thr Aro 

20 25 30 

Asn He Asp Asn Ala Phe Gly Pro Gly Leu Leu Thr Asn Val Gin 

35 40 45 

Cys Ser Gly Ser Ser Arg Arg Gin Gly Ala 

50 55 



<210> 337 
<211> 107 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI: 815679 . 1 .orf 3 :2000FEB01 
<400> 337 



Leu 


Arg 


Tyr He 


Asn 


Gly 


Ser 


Met 


Ser 


Ser 


Leu 


Tyr Pro Arg Leu 


1 






5 










10 




15 


Cys 


His 


Leu Ser 


Leu 


Gin 


Phe 


Leu 


Pro 


Leu 


Lys 


Asn Arg Ser He 








20 










25 




30 


Phe 


Leu 


Gin Ser 


Leu 


Met 


Leu 


Gly Phe 


Glu 


Leu 


Cys Leu Ala Leu 








35 










40 




45 


Ala 


Thr 


Gly He 


Leu 


He 


cys 


Met 


Thr 


Lys 


Asn 


Leu Glu Ser Val 








50 










55 




60 


Asn 


Ser 


Phe Val 


Leu 


Ala 


His 


Ser 


Cys 


Tyr 


His 


His Glu Asn Lys 








65 










70 




75 


Pro 


Arg 


Pro Gly 


Cys 


Cys 


Phe 


Gin 


Gin 


Lys 


He 


Arg Asp Thr Lys 








80 










85 




90 


Asn 


Lys 


Val Glu 


Leu 


Pro 


Arg 


His 


Ala 


His 


Ala 


Arg Leu Thr Asn 



95 100 105 

Pro Gin 



<210> 338 
<211> 147 
<212> PRT 
<213> Homo sapiens 

<220> 

<221^ misc feature 

<223> Incyte ID No: LI :481361 .3 .orf 3 :2000FEB01 
<400> 338 



177/228 



WO 01/62927 



PCT/US01/06059 



Cys 


Pro 


Leu Gin 


Glu 


Met 


Lys 


Pro 


1 






5 








Leu 


Pro 


Lys Leu 


Lys 
20 


Ser 


Met 


Ala 


Glu 


Lys 


Gly Met 


Lys 
35 


Ala 


Phe 


Val 


Lys 


His 


Glu Cys 


Asn 
50 


Leu 


He 


Phe 


Ala 


Ser 


Leu Ala 


Arg 
65 


Gly 


Phe 


Ala 


Pro 


GlU 


Leu Arg 


Gly 
80 


Lys 


Gin 


Phe 


Val 


Asn 


Thr Asp 


Thr 
95 


lie 


Pro 


Phe 


Gin 


Arg 


Gin Lys 


Leu 
110 


Leu 


Glu 


Gin 


Asn 


Glu 


Gly Arg 


Arg 
125 


Lys 


Phe 


He 


Gin 


Lys 


Ala Lys 


Lys 
140 


Glu 


Lys 


Lys 



<210> 339 
<211> 257 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG:247388.1 

<400> 339 



Gin 


Asn 


Ser 


Val 


Lys 


Leu 


Ala 


He 


1 








5 








Leu 


Gin 


Ala 


Met 


Val 


Arg 


Ser 


Ala 










20 






Lys 


Leu 


Leu 


Pro 


Lys 
35 


Leu 


Lys 


Ala 


Phe 


Ser 


Gin 


Met 


Val 
50 


Arg 


Cys 


Leu 


He 


Gin 


Arg 


Arg 


Tyr 
65 


Leu 


Tyr 


Glu 


Gly 


Asn 


Leu 


Arg 


Gin 
80 


Ala 


Ala 


He 


Ser 


Asp 


Arg 


Phe 


Val 


Phe 


Leu 


Leu 








95 








Gly 


He 


Asn 


Leu 


Thr 
110 


Ala 


Ala 


Asp 


Asp 


Trp 


Asn 


Pro 


Gin 
125 


Asn 


Asp 


Leu 


Arg 


He 


Gly 


Gin 


Ser 
140 


Lys 


Ala 


Val 


Arg 


Asn 


Ser 


Tyr 


Glu 

155 


Arg 


Glu 


Met 


Leu 


Gly 


Leu 


Asp 


Lys 


Ala 


Val 


Leu 








170 








Gly 


Asn 


He 


Thr 


Gly 
185 


He 


Gin 


Gin 


Asp 


Leu 


Leu 


Arg 


Lys 
200 


Gly 


Ala 


Tyr 


Asp 


Glu 


Gly 


Ser 


Lys 
215 


Phe 


Cys 


Glu 


Leu 


Arg 


Arg 


Thr 


Thr 


Thr 


He 


Thr 








230 








Ser 


Thr 


Phe 


Ala 


Lys 


Ala 


Ser 


Phe 



245 
Asp He 



Gin 


Arg 


Asn 


Thr 


Ala Asp Leu 




10 






15 


Leu 


Ala 


Asp 


Arg 


Ala Val Phe 




25 






30 


ser 


Tyr 


Val 


Gin 


Ala Tyr Ala 




40 






45 


Arg 


Leu 


Lys 


Asp 


Leu Asp Phe 




55 






60 


Leu 


Leu 


Arg 


Met 


Pro Lys Met 




70 






75 


Pro 


Asp 


Phe 


Val 


Pro Val Asp 




85 






90 


Lys 


Asp 


Lys 


He 


Arg Glu Lys 




100 






105 


Gin 


Arg 


Arg 


Glu 


Lys Thr Glu 




115 






120 


Lys 


Asn 


Lys 


Ala 


Trp Ser Lys 




130 






135 


Lys 


Lys 


Met 


Asn 





145 



orfl:2000MAY19 



Leu 


Tyr 


Leu Met 


Thr Phe His 




10 




15 


Gly 


Lys 


Leu Val 


Leu He Asp 




25 




30 


Gly 


Gly 


His Lys 


Val Leu lie 




40 




45 


Asp 


He 


Leu Glu 


Asp Tyr Leu 




55 




60 


Arg 


He 


Asp Gly 


Arg Val Arg 




70 




75 


Asp 


Arg 


Phe Ser 


Lys Pro Asp 




85 




90 


Cys 


Thr 


Arg Ala 


Gly Gly Leu 




100 




105 


Thr 


Cys 


He He 


Phe Asp Ser 




115 




120 


Gin 


Ala 


Gin Ala 


Arg Cys His 




130 




135 


Lys 


Val 


Tyr Arg 


Leu lie Thr 




145 




150 


Phe 


Asp 


Lys Ala 


Ser Leu Lys 




160 




165 


Gin 


Ser 


Met Ser 


Gly Arg Asp 




175 




180 


Phe 


Ser 


Lys Lys 


Glu He Glu 




190 




195 


Ala 


Ala 


He Met 


Glu Glu Asp 




205 




210 


Glu 


Asp 


He Asp 


Gin lie Leu 




220 




225 


He 


Glu 


Ser Glu 


Gly Lys Gly 




235 




240 


Val 


Ala 


Ser Glu 


Asn Arg Thr 




250 




255 



178/228 



WO 01/62927 



PCT/US01/06059 



LG :255789 . 10 .orf 3 : 2000MAY19 



He Leu Phe Ser His Phe Pro He Gin Met 

10 15 
Leu Val Pro Leu Leu Pro Ser Lys Thr He 

25 30 
Arg Gly Gin Val Phe Leu Met Leu Leu Arg 

40 45 
Ala His Asp Leu Leu Val Lys Gly Ala Gly 

55 60 



LI:787618.1.orfl:2000MAY01 



<210> 340 
<211> 63 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 

<400> 340 

Leu Lys Glu Leu Thr 
1 5 
Lys Thr Ala Ser Phe 
20 

Leu Phe Asp Arg Ala 
35 

Lys Pro Ser He Thr 
50 

Lys Tyr Lys 



<210> 341 
<211> 112 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 
<223> Incyte ID No: 

<400> 341 



Gly 


Thr 


Leu 


Met 


Asp 


1 








5 


He 


Cys 


Leu 


Cys 


Ser 
20 


Ser 


Pro 


Ala 


Pro 


Asn 
35 


His 


Trp 


Asn 


He 


Asn 








50 


Phe 


He 


Cys 


Ala 


His 








65 


Gin 


Ala 


Arg 


Leu 


Leu 








80 


Gin 


Ser 


Gin 


Ala 


Ser 
95 


Ser 


Ala 


Arg 


Pro 


He 
110 



<210> 342 
<211> 427 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 

<400> 342 

Thr Arg Thr Thr Thr 
1 5 
Phe Arg Cys Thr Arg 
20 

Cys Ser Pro Arg He 
35 

Tyr Ser He Leu Pro 
50 

Tyr Val Ser Val Asn 



Pro 


Phe 


Pro Pro 


Cys 








10 


Ser 


Ser 


Pro Leu 


Lys 








25 


Val 


Ala 


Phe Ser 


His 








40 


Leu 


His 


Asp- Gin 


Lys 






55 


Ser 


Leu 


Leu Leu 


Ser 








70 


Leu 


Pro 


Ala Glu 


Asp 








85 


Gin 


Thr 


Arg Ser 


Cys 








100 


Ala 


Pro 







He 


Gin Asp Ser Ala 




15 


Asn 


Arg Glu Tyr He 




30 


Met 


Ser Ser Phe Gly 




45 


Leu 


Gly Lys Cys Ala 




60 


Pro 


Lys Glu Gin Ala 




75 


Lys 


Arg His Gin Glu 




90 


Gin 


He Asn Gin Ser 




105 



LI:331610.2.orf3:2000MAY01 



Arg Leu Gly Ser Pro Lys Gly Ser Thr Cys 

10 15 
Thr Thr Ser Arg Glu Ser Thr Trp Ala Leu 

25 30 
Pro Thr Trp Ala Lys Asn Gly Thr Val Ser 

40 45 
Ser His He Gly Asp Val Ser He Tyr Thr 

55 60 
Pro Thr Asn Gly Ala He Tyr Ala Leu Arg 



179/228 



WO 01/62927 



PCT/US01/06059 









65 










70 






75 


Ser 


Phe 


Asn Phe 


Glu 


Gin 


Thr 


Lys 


Ala 


Phe 


Glu 


Phe Lys Val 


Leu 








80 








85 






90 


Ala 


Lys 


Asp Ser 


Gly 


Ala 


Pro 


Ala 


His 


Leu 


Glu 


Ser Asn Ala 


Thr 




95 










100 






105 


Val 


Arg 


Val Thr 


Val 


Leu 


Asp 


Val 


Asn 


Asp 


Asn 


Ala Pro Val 


He 








110 








115 






120 


Val 


Leu 


Pro Thr 


Leu 


Gin 


Asn 


Asp 


Thr 


Ala 


Glu 


Leu Gin Val 


Pro 








125 








13 0 






135 


Arg 


Asn 


Ala Gly 


Leu 


Gly 


Tyr 


Leu 


Val 


Ser 


His 


Cys Ala Arg 


Pro 








140 










145 






150 


Arg 


Gin 


Arg Leu 


Arg 


Arg 


Arg 


Ala 


Gly 


Val 


Ser 


Pro Thr Lys 


He 








155 










160 






165 


Val 


Asp 


Gly Asn 


Asp 


Asp 


His 


Leu 


Phe 


Glu 


He 


Asp Pro Ser 


Ser 




170 










175 






180 


Gly 


Glu 


lie Arg 


Thr 


Leu 


His 


Pro 


Phe 


Trp 


Glu 


Asp Val Thr 


Pro 






185 










190 






195 


Val 


Val 


Glu Leu 


Val 


Val 


Lys 


Val 


Thr 


Asp 


His 


Gly Lys Pro 


Thr 








200 








205 






210 


Leu 


Ser 


Ala Val 


Ala 


Lys 


Leu 


He 


He 


Arg 


Ser 


Val Ser Gly 


Ser 








215 








220 






225 


Leu 


Pro 


Glu Gly 


Val 


Pro 


Arg 


Val 


Asn 


Gly 


Glu 


Gin His His 


Trp 






230 








235 






240 


Asp 


Met 


Ser Leu 


Pro 


Leu 


He 


Val 


Thr 


Leu 


Ser 


Thr He Ser 


He 






245 










250 






255 


lie 


Leu 


Leu Ala 


Ala 


Met 


He 


Thr 


He 


Ala 


Val 


Lys Cys Lys 


Arg 








260 










265 




270 


Glu 


Asn 


Lys Glu 


He 


Arg 


Thr 


Tyr 


Asn 


Cys 


Arg 


He Ala Glu 


Tyr 






275 










280 






285 


Ser 


His 


Pro Gin 


Leu 


Gly 


Gly 


Gly 


Lys 


Gly 


Lys 


Lys Lys Lys 


He 








290 










295 






300 


Asn 


Lys 


Asn Asp 


lie 


Met 


Leu 


Val 


Gin 


Ser 


Glu 


Val Glu Glu 


Arg 




305 










310 






315 


Asn 


Ala 


Met Asn 


Val 


Met 


Asn 


Val 


Val 


Ser 


Ser 


Pro Ser Leu 


Ala 








320 










325 






330 


Thr 


Ser 


Pro Met 


Tyr 


Phe 


Asp 


Tyr 


Gin 


Thr 


Arg 


Leu Pro Leu 


Ser 








335 










340 






345 


Ser 


Pro 


Arg Ser 


Glu 


Val 


Met 


Tyr 


Leu 


Lys 


Pro 


Ala Ser Asn 


Asn 








350 








355 






360 


Leu 


Thr 


Val Pro 


Gin 


Gly 


His 


Ala 


Gly 


Cys 


His 


Thr Ser Phe 


Thr 








365 






370 






375 


Gly 


Gin 


Gly Thr 


Asn 


Ala 


Ser 


Glu 


Thr 


Pro 


Ala 


Thr Arg Met 


Ser 




380 










385 






390 


lie 


lie 


Gin Thr 


Asp 


Asn 


Phe 


Pro 


Ala 


Glu 


Pro 


Asn Tyr Met 


Gly 








395 










400 




405 


Ser 


Arg 


Gin Gin 


Phe 


Val 


Gin 


Ser 


He 


Ser 


Val 


Ala Pro Arg 


Leu 






410 










415 






420 


Arg 


Thr 


Gin Lys 


Glu 


Pro 


Ala 


















425 



















<210> 343 
<211> 144 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG:982697 . l.orf 2:2000FEB18 
<220> 

<221> unsure ' 

<222> 52, 56 

<223> unknown or other 

<400> 343 

Gly Ser He Glu Gly Lys Cys Gly Val Gly Gly Ser Asn Arg Val 
15 10 15 



180/228 



WO 01/62927 



PCT/US01/06059 



Thr 


Ala 


Gly Ala 


Leu 


Pro 


Asn 


Gly 


Thr 


He 


Arg 


Ser Gly Pro 


Leu 






20 








25 






30 


Pro 


Ser 


Arg Pro 


Lys 


Asp 


Asp 


Arg 


Ser 


Thr 


Ser 


Ser Leu Tyr 


Ser 








35 










40 






45 


Ala 


Pro 


Gly Lys 


Ala 


Thr 


Xaa 


Thr 


Gin 


Leu 


Xaa 


Pro Met Ser 


Ala 








50 










55 






60 


Ala 


Leu 


Asp Ser 


Leu 


Pro 


Cys 


Lys 


Ala 


He 


Gly 


Ala Gly Leu 


Leu 








65 










70 






75 


Lys 


Ala 


Trp Gly 


Ala 


His 


Pro 


Leu 


Tyr 


Gin 


Cys 


Gly Leu Asp 


Val 








80 










85 






90 


Glu 


His 


Asp Val 


Glu 


Asp 


Tyr 


Phe 


Gly 


Thr 


Leu 


Arg Phe Ser 


Asp 








95 










100 






105 


Phe 


Pro 


Thr Gly 


Phe 


Trp 


Ser 


Cys 


Val 


Asp 


Pro 


Val Asp Pro 


Phe 








110 










115 






120 


Phe 


Tip 


Pro lie 


Ser 


Pro 


Phe 


Leu 


Gly 


Trp 


Lys 


His Leu Pro 


Asn 








125 










130 






135 


Pro 


Tyr 


Thr Pro 


He 


Val 


Ser 


Trp 


Lys 











140 

<210> 344 
<211> 97 
<212> PRT 

<213> Homo sapiens 



<221> misc_feature 

<223> Incyte ID No: LG:1080896 . 1 . orf2 :2000FEB18 
<400> 344 



Lys Lys 


Leu 


Val 


cys 


He 


Met 


Leu Phe 


Tyr 


Pro Arg Tyr Val 


Ala 


1 






5 








10 




15 


Gin Glu 


He 


Thr 


Asn 


Leu 


Met 


Arg Val 


Met 


Thr His Phe Lys 


Pro 








20 






25 




30 


Pro Met 


He 


Thr 


Val 


Phe 


Lys 


He Gin 


Leu 


Arg Thr Gly Pro 


Phe 








35 






40 




45 


Leu Gly 


Asp 


Thr Arg 


Lys 


Glu 


He Ala 


Arg 


Thr Glu Glu Lys 


Gly 








50 








55 




60 


Val Pro 


Lys 


Asn 


Val 


Thr 


Leu 


Lys Phe 


Asp 


Ala cys Ala Thr 


He 






65 






70 




75 


Asp Ser 


Lys 


Gin 


His 
80 


Gly 


He 


Gly Cys 


Gly 
85 


Ser Leu Asn Trp 


Lys 
90 


Lys Lys 


Leu 


His 


Ser 
95 


Arg 


Lys 











<210> 345 
<211> 75 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 811341 . 1 .orf 3 :2000FEB01 



<400> 345 








Gly Leu Phe 


Gin Cys 


He 


His 


1 


5 






Val Ala Thr 


Val Leu 


Leu 


Phe 




20 






Thr Leu Lys 


He Thr 


Cys 


Leu 




35 






Cys Ser Pro 


Gly Ser 


Asp Gin 




50 






Glu Pro Pro 


Met Thr 


Thr 


Val 




65 







Gin Val Thr 


Glu Val Gly Gin 


Lys 


10 




15 


Tyr Gly Tyr 


Tyr Lys Cys Thr 


Gly 


25 




30 


Tyr Asn Val 


He Leu Tyr Lys 


Val 


40 




45 


Pro Asp Val 


Cys Tyr Asp Pro 


Ser 


55 




60 


Phe Lys He 


Arg Leu Arg Thr 


Glu 


70 




75 



<210> 346 



181/228 



WO 01/62927 



PCT7US01/06059 



<211> 135 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 903225 . 1 .orf 3 :2000FEB01 
<400> 346 



Asp 


Pro 


Phe 


Gin Lys 


Met Ala Pro Lys 


Val 


Lys 


Lys Glu Ala 


Pro 


1 






5 




10 






15 


Gly 


Pro 


Pro 


Lys Ala 


Glu Ala Lys Ala 


Lys 


Ala 


Leu Lys Ala 


Lys 








20 




25 






30 


Lys 


Val 


Val 


Leu Lys 


Gly Val His Gly 


His 


Lys 


Lys Lys Lys 


He 






35 




40 






45 


Arg 


Met 


Ser 


Pro Thr 


Phe Gin Arg Pro 


Lys 


Thr 


Leu Arg Leu 


Trp 






50 




55 






60 


Arg 


Pro 


Pro 


Arg Tyr 


Pro Arg Lys Thr 


Thr 


Pro 


Arg Arg Asn 


Lys 








65 




70 






75 


Leu 


Asp 


His 


Tyr Ala 


He He Lys Phe 


Pro 


Leu 


Thr Thr Glu 


Phe 






80 


85 






90 


Ala 


Met 


Lys 


Lys He 


Lys Asp Asn Asn 


Thr 


Leu 


Val Phe Thr 


Val 








95 




100 






105 


Asp 


Val 


Lys 


Ala Asn 


Lys His Gin He 


Lys 


Gin 


Ala Val Lys 


Lvs 






110 




115 






120 


Leu 


Cys 


Asp 


He Asp 


Gly Ala Lys Val 


Asn 


Thr 


Leu Met Glu 


Arg 




125 




130 






135 



<210> 347 

<211> 55 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 242079 . 2 .orf 2 :2000FEB01 
<400> 347 

Ser Ala Arg He Glu Ala Trp Glu Pro Pro Thr Arg Phe Gin Ser 
15 10 15 

Met Cys Gly Lys Ala Trp Met Ser Arg Gin Lys Pro Ala Ala Gly 

20 25 30 

Thr Glu Pro Ser Trp Arg Thr Ser Thr Arg Val Val Trp Arg Gly 

35 40 45 

Asn Leu Gly Leu Glu Phe Pro His Ser Phe 

50 55 

<210> 348 
<211> 129 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 979580.1. orf 2 :2000MAY19 
<400> 348 

Trp Lys Val Asn Gly Arg Asn Leu Ser Pro Phe Glu Glu He Gly 
1 5 10 15 

Asn Gin Ser His Phe Val Ala Gin Ala Gly Val Gin Trp His Asn 

20 25 30 

Leu Ala His Cys Asn His His Leu Pro Gly Ser Ser Asp Pro Pro 

35 40 45 

Thr Ser Thr Ser Gin Val Ala Gly Ser Ala Gly Val Arg His His 

50 55 60 

Thr Arg Leu He Phe Val Phe Leu Val Gin Lys Glu Phe His His 



182/228 



WO 01/62927 
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65 70 75 

Val Asp Gin Ala Gly Leu Lys Leu Leu Thr Ser Ser Asp Trp Pro 
80 85 90 

Thr Trp Ala Ser Gin Ser Ala Gly lie Thr Gly Val Ser His Cys 
95 100 105 

Ser Pro Ala Tyr Glu Val Val Phe Ala Val Lys Gin Gin Phe Gly 
110 115 120 

Asn Glu Ala Phe Leu Arg Ser Ser Val 
125 

<210> 349 
<211> 291 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI: 1169865 .l.orf 2 :2000MAY01 
<400> 349 



Arg 


Pro 


Thr 


Val 


Ser 


Val 


Ser Cys Ala 


Ser 


Ser 


Arg Pro Gin Phe 


1 








5 






10 




15 


Leu 


lie 


Thr 


Val 


Pro 


Val 


Leu Thr Val 


He 


Asn 


Tyr Arg Pro His 










20 






25 




30 


Asii 


Met 


Arg 


Pro 


Glu 


Asp 


Arg Met Phe 


His 


He 


Arg Ala Val He 










35 






40 




45 


Leu 


Arg 


Ala 


Leu 


Ser 


Leu 


Ala Phe Leu 


Leu 


Ser 


Leu Arg Gly Ala 










50 






55 




60 


Gly 


Ala 


He 


Lys 


Ala 


Asp 


His Val Ser 


Thr 


Tyr 


Ala Ala Phe Val 






65 






70 




75 


Gin 


Thr 


His 


Arg 


Pro 


Thr 


Gly Glu Phe 


Met 


Phe 


Glu Phe Asp Glu 










80 






85 




90 


Asp 


GlU 


Met 


Phe 


Tyr 


Val 


Asp Leu Asp 


Lys 


Lys 


Glu Thr Val Trp 










95 






100 




105 


His 


Leu 


Glu 


Glu 


Phe 


Gly 


Gin Ala Phe 


Ser 


Phe 


Glu Ala Gin Gly 










110 




115 




120 


Gly 


Leu 


Ala 


Asn 


He 


Ala 


He Leu Asn 


Asn 


Asn 


Leu Asn Thr Leu 








125 






130 




135 


lie 


Gin 


Arg 


Ser 


Asn 


His 


Thr Gin Ala 


Thr 


Asn 


Asp Pro Pro Glu 








140 






145 




150 


Val 


Thr 


Val 


Phe 


Pro 


Lys 


Glu Pro Val 


Glu 


Leu 


Gly Gin Pro Asn 










155 




160 




165 


Thr 


Leu 


He 


Cys 


His 


He 


Asp Lys Phe 


Phe 


Pro 


Pro Val Leu Asn 










170 




175 




180 


Val 


Thr 


Trp 


Leu 


Cys 


Asn 


Gly Glu Leu 


Val 


Thr 


Glu Gly Val Ala 










185 






190 




195 


Glu 


Ser 


Leu 


Phe 


Leu 


Pro 


Arg Thr Asp 


Tyr 


Ser 


Phe His Lys Phe 










200 






205 




210 


His 


Tyr 


Leu 


Thr 


Phe 


Val 


Pro Ser Ala 


Glu 


Asp 


Phe Tyr Asp Cys 










215 






220 




225 


Arg 


Val 


Glu 


His 


Trp 


Gly 


Leu Asp Gin 


Pro 


Leu 


Leu Lys His Trp 










230 






235 




240 


Glu 


Ala 


Gin 


Glu 


Pro 


He 


Gin Met Pro 


Glu 


Thr 


Thr Glu Thr Val 










245 






250 




255 


Leu 


Cys 


Ala 


Leu Gly 


Leu 


Val Leu Gly 


Leu 


Val 


Gly He He Val 










260 






265 




270 


Gly 


Ser 


Val 


Leu 


He 


He 


Lys Ser Leu 


Arg 


Ser 


Gly His Asp Pro 










275 






280 




285 


Arg 


Ala 


Gin 


Gly Thr 


Leu 


















290 













<210> 350 
<211> 517 
<212> PRT 
<213> Homo sapiens 

<220> 



183/228 



WO 01/62927 



PCTYUS01/06059 



<221> misc_feature 

<223> Incyte ID No: LG : 337818. 2. orf 1:2000FEB18 
<400> 350 



Arg 


Gly 


Ala 


Trp 


Glu 


Arg 


Arg 


Arg 


Ser 


Arg 


Pro 


Ala 


Glu Met Glu 


1 








5 










10 






15 


Ala 


Thr 


Gly 


Thr 


Trp 


Ala 


Leu 


Leu 


Leu 


Ala 


Leu 


Ala 


Leu Leu Leu 








20 










25 






30 


Leu 


Leu 


Thr 


Leu 


Ala 


Leu 


Ser 


Gly 


Thr 


Arg 


Ala 


Arg 


Gly His Leu 










35 










40 






45 


Pro 


Pro 


Gly 


Pro 


Thr 


Pro 


Leu 


Pro 


Leu 


Leu 


Gly 


Asn 


Leu Leu Gin 








50 










55 






60 


Leu 


Arg 


Pro 


Gly 


Ala 


Leu 


Tyr 


Ser 


Gly 


Leu 


Met 


Arg 


Leu Ser Lys 










65 










70 






75 


Lys 


Tyr 


Gly 


Pro 


Val 


Phe 


Thr 


lie 


Tyr 


Leu 


Gly 


Pro 


Trp Arg Pro 










80 










85 






90 


Val 


Val 


Val 


Leu 


Val 


Gly 


Gin 


Glu 


Ala 


Val 


Arg 


Glu 


Ala Leu Gly 










95 








100 






105 


Gly 


Gin 


Ala 


Glu 


Glu 


Phe 


Ser 


Gly 


Arg 


Gly 


Thr 


Val 


Ala Met Leu 










110 










115 






120 


Glu 


Gly 


Thr 


Phe 


Asp 


Gly 


His 


Gly 


Val 


Phe 


Phe 


Ser 


Asn Gly Glu 










125 










130 






135 


Arg 


Trp 


Arg 


Gin 


Leu 


Arg 


Lys 


Phe 


Thr 


Met 


Leu 


Ala 


Leu Arg Asp 










140 










145 






150 


Leu 


is xy 


Met 


Gly 


Lys 


Arg 


Glu 


Gly 


GlU 


Glu 


Leu 


He 


Gin Ala Glu 










155 










160 






165 


Ala 


Arg 


Cys 


Leu 


Val 


Glu 


Thr 


Phe 


Gin 


Gly 


Thr 


Glu 


Gly Arg Pro 










170 










175 






180 


Phe 


Asp 


Pro 


Ser 


Leu 


Leu 


Leu 


Ala 


Gin 


Ala 


Thr 


Ser 


Asn Val Val 








185 










190 






195 


Cys 


Ser 


Leu 


Leu 


Phe 


Gly 


Leu 


Arg 


Phe 


Ser 


Tyr 


Glu 


Asp Lys Glu 










200 










205 






210 


Phe 


Gin 


Ala 


Val 


Val 


Arg 


Ala 


Ala 


Gly 


Gly 


Thr 


Leu 


Leu Gly Val 










215 










220 






225 


Ser 


Ser 


Gin 


Gly 


Gly 


Gin 


Thr 


Tyr 


Glu 


Met 


Phe 


Ser 


Trp Phe Leu 










230 










235 






240 


Arg 


Pro 


Leu 


Pro 


Gly 


Pro 


His 


Lys 


Gin 


Leu 


Leu 


His 


His Val Ser 










245 










250 






255 


Thr 


Leu 


Ala 


Ala 


Phe 


Thr 


Val 


Arg 


Gin 


Val 


Gin 


Gin 


His Gin Gly 










260 










265 






270 


Asn 


Leu 


Asp 


Ala 


Ser 


Gly 


Pro 


Ala 


Arg 


Asp 


Leu 


Val 


Asp Ala Phe 










275 










280 






285 


Leu 


Leu 


Lys 


Met 


Ala 


Gin 


Glu 


Glu 


Gin 


Asn 


Pro 


Gly 


Thr Glu Phe 










290 










295 






300 


Thr 


Asn 


Lys 


Asn 


Met 


Leu 


Met 


Thr 


Val 


He 


Tyr 


Leu 


Leu Phe Ala 










305 










310 






315 


Gly 


Thr 


Met 


Thr 


Val 


Ser 


Thr 


Thr 


Val 


Gly 


Tyr 


Thr 


Leu Leu Leu 










320 










325 






330 


Leu 


Met 


Lys 


Tyr 


Pro 


His 


Val 


Gin 


Lys 


Trp 


Val 


Arg 


Glu Glu Leu 


* 








335 










340 






345 


Asn 


Arg 


Glu 


Leu 


Gly 


Ala 


Gly 


Gin 


Ala 


Pro 


Ser 


Leu 


Gly Asp Arg 










350 










355 






360 


Thr 


Arg 


Leu 


Pro 


Tyr 


Thr 


Asp 


Ala 


Val 


Leu 


His 


Glu 


Ala Gin Arg 










365 










370 






375 


Leu 


Leu 


Ala 


Leu 


Val 


Pro 


Met 


Gly 


lie 


Pro 


Arg 


Thr 


Leu Met Arg 










380 










385 






390 


Thr 


Thr 


Arg 


Phe 


Arg 


Gly 


Tyr 


Thr 


Leu 


Pro 


Gin 


Gly 


Thr Glu Val 










395 










400 






405 


Phe 


Pro 


Leu 


Leu 


Gly 


Ser 


lie 


Leu 


His 


Asp 


Pro 


Asn 


He Phe Lys 










410 










415 






420 


His 


Pro 


Glu 


Glu 


Phe 


Asn 


Pro 


Asp 


Arg 


Phe 


Leu 


Asp 


Ala Asp Gly 










425 










430 






435 


Arg 


Phe 


Arg 


Lys 


His 


Glu 


Ala 


Phe 


Leu 


Pro 


Phe 


Ser 


Leu Gly Lys 










440 










445 






450 


Arg 


Val 


Cys 


Leu 


Gly 


GlU 


Gly 


Leu 


Ala 


Lys 


Ala 


Glu 


Cys Phe Leu 










455 










460 






465 


Phe 


Phe 


Thr 


Thr 


lie 


Leu 


Gin 


Ala 


Phe 


Ser 


Leu 


Glu 


Ser Pro Cys 



184/228 



WO 01/62927 



PCT/US01/06059 



470 475 480 

Pro Pro Asp Thr Leu Ser Leu Lys Pro Thr Val Ser Gly Leu Phe 

485 490 495 

Asn lie Pro Pro Ala Phe Gin Leu Gin Val Arg Pro Thr Asp Leu 

500 505 510 
His Ser Thr Thr Gin Thr Arg 

515 



<210> 351 
<211> 232 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> misc__feature 
<223> Incyte ID No: LI: 

<400> 351 



Leu 


Ser 


Thr 


Asp 


He 


Asn He 


1 






5 




Ser 


Gin 


Val 


Thr 


His 


Gin Leu 










20 




Gly 


Val 


Trp 


Leu 


Ser 


Pro Ser 










35 




Val 


His 


Ser 


Ser 


His 


Ala Leu 










50 




Pro 


Cys 


Pro 


Trp cys 


Leu Ala 










65 




Gly 


His 


Ala 


Thr 


Gin 


Arg Leu 










80 




Ala 


Pro 


Ser 


Leu Gly 


Tyr Thr 










95 




Cys 


Pro 


His 


Thr 


Pro 


Asn His 










110 




Asp 


lie 


Val 


Leu 


Ala 


Pro Gin 










125 




Ala 


Pro 


Thr 


Glu 


Ala 


Gin Ser 










140 




Leu 


Ser 


His 


Pro 


Phe 


Leu Ser 










155 




Thr 


Ser 


Gly 


Ser 


Ala 


He Met 










170 




Thr 


Leu 


Arg 


Pro 


Glu 


Glu His 








185 




Cys 


Asn 


His 


Val 


Glu 


Ser Gly 










200 




Tyr 


Thr 


Leu 


Thr Asp 


Leu Gin 










215 




Thr 


Arg 


Pro 


Ser Gly 


Arg Gly 










230 





.l.orfl:2000FEB01 



Val His 


Leu 


Gin 


Ser Gin Ala 


His 




10 






15 


Pro Trp 


He 


Cys 


Ser Pro His 


Val 




25 






30 


Gin Ala. 


Thr 


Glu 


Thr Ala Thr 


His 




40 






45 


Ser He 


His 


Arg 


Thr Ser Gin 


Cys 




55 






60 


Gin Gly 


Thr 


Ala 


Cys Pro Leu 


Arg 




70 






75 


Ser Leu 


Ser 


Met 


Ala Pro Thr 


His 




85 






90 


Thr Leu 


Pro 


Ala 


Cys Asp His 


Arg 




100 






105 


Leu Ser 


Thr 


Gin 


Leu Pro Thr 


Tyr 




115 






120 


Ser He 


Phe 


Pro 


Leu Arg His 


Ala 




130 




135 


Pro Ala 


Thr 


Ser 


Ala Thr Ala 


Ala 




145 






150 


Thr Leu 


He 


Leu 


Pro Asn Ala 


Asn 




160 






165 


His Arg 


Asp 


Phe 


Gly His Thr 


Arg 




175 






180 


Leu Pro 


Asn 


Pro 


Asn Thr Cys 


Leu 




190 




195 


Pro Cys 


Cys 


Pro 


Ser Thr His 


Thr 




205 






210 


Pro Leu 


Phe 


Gly 


Val Arg Val 


Pro 




220 


225 



<210> 352 
<211> 220 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LG:241577 . 4.orf 1:2000MAY19 
<400> 352 

Gly Lys Val Glu Val Glu Asp Glu Gly Cys Thr Ala Gin Lys Ala 
15 10 15 

Pro Val Arg Pro Gly Leu Leu Pro Pro Cys Leu Thr Pro Glu He 

20 25 30 

Gly Ala Gly Val Pro Ser Ala Gly Cys Pro Leu Cys Pro Ser Met 

35 40 45 



185/228 



WO 01/62927 



PCT/US01/06059 



Pro 


Pro 


Trp 


Ala 


Cys 


Ser 


Tyr 


His Thr 


Ser 


His 


Val Pro Met He 








50 








55 




60 


Pro 


Leu 


Leu 


Gly Pro 


Arg 


Ser 


Ser Phe 


Ser 


Arg 


Lys Trp Ser Ala 










65 








70 




75 


Arg 


Ala 


Arg 


Gly Gly 


Gly 


Lys 


Met Ser 


Pro 


Tyr 


Thr Asn Cys Tyr 










80 








85 




90 


Ala 


Gin 


Arg 


Tyr Tyr 


Pro 


Met 


Pro Glu 


Glu 


Pro 


Phe Cys Thr Glu 








95 








100 




105 


Leu 


Asn 


Ala 


Glu 


GlU 


Gin 


Ala 


Leu Lys 


Glu 


Lys 


Glu Lys Gly Ser 










110 








115 




120 


Trp 


Thr 


Gin 


Leu 


Thr 


His 


Ala 


Glu Lys 


Val 


Ala 


Leu Tyr Arg Leu 








125 








130 




135 


Gin 


Phe 


Asn 


Glu 


Thr 


Phe 


Ala 


Glu Met 


Asn 


Arg 


Arg Ser Asn Glu 










140 








145 




150 


Trp 


Lys 


Thr 


Val 


Met 


Gly 


Cys 


Val Phe 


Phe 


Phe 


He Gly Phe Ala 








155 








160 




165 


Ala 


Leu 


Val 


He 


Trp 


Trp 


Gin 


Arg Val 


Tyr 


Val 


Phe Pro Pro Lys 










170 








175 




180 


Pro 


lie 


Thr 


Leu 


Thr 


Asp 


Glu 


Arg Lys 


Ala 


Gin 


Gin Leu Gin Arg 










185 








190 




195 


Met 


Leu 


Asp 


Met 


Lys 


Val 


Asn 


Pro Val 


Gin 


Gly 


Leu Ala Ser Arg 








200 








205 




210 


Trp 


Asp 


Tyr 


Glu Lys 


Lys 


Gin 


Trp Lys 


Lys 







215 220 

<210> 353 

<211> 95 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:344786.4.orf 1:2000MAY19 
<400> 353 



Pro He 


Leu 


Trp 


Ser 


Val 


Leu Ser Phe 


Ser 


He 


Glu Leu Cys Phe 


1 




5 






10 




15 


Cys Cys 


Phe 


He 


Leu 


Ser 


Leu Leu Cys 


Val 


Phe 


Cys Pro He Leu 






20 






25 




30 


Cys Ser 


Arg 


His 


Gin 


Glu 


Pro Arg His 


Pro 


Pro 


Pro Val Thr Tyr 






35 






40 




45 


Ser Ser 


Glu 


Pro 


Ala 


Arg 


Ser Leu Phe 


Arg 


Met 


He Thr Trp Arg 








50 






55 




60 


Ser Leu 


Arg 


Lys 


Leu 


Leu 


Lys Asn Thr 


Leu 


Val 


Pro Ser Leu Ser 




65 




70 




75 


Gly Leu 


Gly 


Pro 


Phe 


Arg 


His Phe Ser 


Val 


Ser 


Met Thr Gin Thr 




80 






85 




90 



Met Gin Arg His Phe 
95 

<210> 354 
<211> 331 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI:414307.1.orf2:2000FEB01 
<220> 

<221> unsure 
<222> 191 

<223> unknown or other 
<400> 354 

Gly Met Gly Lys Leu Cys Leu Gly Pro Thr Leu Cys Pro Ala Ala 
1.5 10 15 



186/228 



WO 01/62927 



PCTYUS01/06059 



Ala 


Ala 


Glu Glu 


Ser 


Arg 


Asp 


Ala 


Glu 


Pro 


Arg 


Arg Glu Leu 


Leu 








20 










25 






30 


Cys 


Ser 


Gly Arg 


Pro 


Trp 


Thr 


Trp 


Arg 


Ala 


Ala 


Ala Arg Val 


Thr 








35 










40 






45 


Thr 


Met 


He Pro 


Trp 


Val 


Leu 


Leu 


Ala 


Cys 


Ala 


Leu Pro cys 


Ala 








50 










55 






60 


Ala 


Asp 


Pro Leu 


Leu 


Gly 


Ala 


Phe 


Ala 


Arg 


Arg 


Asp Phe Arg 


Lys 








65 










70 






75 


Gly 


Ser 


Pro Gin 


Leu 


Val 


Cys 


Ser 


Leu 


Pro 


Gly 


Pro Gin Gly 


Pro 








80 










85 






90 


Pro 


Gly 


Pro Pro 


Gly 


Ala 


Pro 


Gly 


Pro 


Ser 


Gly 


Met Met Gly 


Arg 








95 










100 






105 


Met 


Gly 


Phe Pro 


Gly 


Lys 


Asp 


Gly 


Gin 


Asp 


Gly 


His Asp Gly 


Asp 








110 










115 






120 


Arg 


Gly 


Asp Ser 


Gly 


Glu 


Glu 


Gly 


Pro 


Pro 


Gly 


Arg Thr Gly 


Asn 








125 










130 






135 


Arg 


Gly 


Lys Pro 


Gly 


Pro 


Lys 


Gly 


Lys 


Ala 


Gly 


Ala lie Gly 


Arg 








140 










145 






150 


Ala 


Gly 


Pro Arg 


Gly 


Pro 


Lys 


Gly 


Val 


Asn 


Gly 


Thr Pro Gly 


Lys 








155 










160 






165 


His 


Gly 


Thr Pro 


Gly 


Lys 


Lys 


Gly 


Pro 


Lys 


Gly 


Lys Lys Gly 


Glu 








170 










175 






180 


Pro 


Gly 


Leu Pro 


Gly 


Pro 


Cys 


Ser 


Cys 


Gly 


Xaa 


Gly His Thr 


Lys 








185 










190 






195 


Ser 


Ala 


Phe Ser 


Val 


Ala 


Val 


Thr 


Lys 


Ser 


Tyr 


Pro Arg Glu 


Arg 








200 










205 






210 


Leu 


Pro 


He Lys 


Phe 


Asp 


Lys 


He 


Leu 


Met 


Asn 


Glu Gly Gly 


His 








215 










220 






225 


Tyr 


Asn 


Ala Ser 


Ser 


Gly 


Lys 


Phe 


Val 


Cys 


Gly 


Val Pro Gly 


He 








230 










235 






240 


Tyr 


Tyr 


Phe Thr 


Tyr 


Asp 


He 


Thr 


Leu 


Ala 


Asn 


Lys His Leu 


Ala 






245 








250 






255 


He 


Gly 


Leu Val 


His 


Asn 


Gly 


Gin 




Arg 


He 


Arg Thr Phe 


Asp 








260 










265 






270 


Ala 


Asn 


Thr Gly 


Asn 


His 


Asp 


Val 


Ala 


Ser 


Gly 


Ser Thr He 


Leu 






275 










280 




285 


Ala 


Leu 


Lys Gin 


Gly 


Asp 


Glu 


Val 


Trp 


Leu 


Gin 


He Phe Tyr 


Ser 








290 










295 






300 


Glu 


Gin 


Asn Gly 


Leu 


Phe 


Tyr 


Asp 


Pro 


Tyr 


Trp 


Thr Asp Ser 


Leu 








305 










310 






315 


Phe 


Thr 


Gly Phe 


Leu 


He 


Tyr 


Ala 


Asp 


Gin 


Asp 


Asp Pro Asn 


Glu 



320 325 330 

Val 



<210> 355 

<211> 93 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 202943 . 2 .orf 3 :2000FEB01 
<400> 355 



Asn Val Arg 


Leu 


Gin 


Glu 


He Arg 


Lys 


Met 


Asp 


Ser Ser Gly 


Glu 


1 




5 








10 






15 


He Tyr Thr 


Val 


Gly 


Ala 


Glu Val 


Thr 


Phe 


Ser 


Cys Gin Glu 


Gly 






20 








25 




30 


Tyr Gin Leu 


Met 


Gly 


Val 


Thr Lys 


He 


Thr 


Cys 


Leu Glu Ser 


Gly 






35 






40 






45 


Glu Trp Asn 


His 


Leu 


He 


Pro Tyr 


Cys 


Lys 


Gly 


Met Phe Ser 


Lys 






50 








55 






60 


Phe Thr Thr 


Phe 


Leu 


Met 


Phe Gly 


Asn 


Pro 


Arg 


Lys Val Arg 


Arg 






65 






70 




75 


Arg His Met 


Lys 


Cys 


Tyr 


Val Val 


Ser 


Met 


Phe 


Phe Val Phe 


Glu 






80 






85 






90 



187/228 



WO 01/62927 



PCT/US01/06059 



Leu Arg Tyr 



<210> 356 
<211> 112 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: LI: 246194 .2 .orf 1:2000FEB01 



<220> 

<221> unsure 

<222> 25, 28, 52, 112 

<223> unknown or other 



<400> 356 








Ser 


Ser 


Thr 


His 


His 


Arg Arg Ser 


1 








5 




Leu 


Pro 


Pro 


Pro 


Pro 


Pro Ala Thr 










20 




Ser 


Ala 


Ala 


Pro Gly 


Pro Gly Ala 










35 




Fto 


Ala 


Ser 


Thr 


xnr 


cys Xaa Val 










50 




Ser 


Gly 


Ser 


Thr 


Ala 


Arg Ser Asp 










65 




Gly 


Pro 


Gly 


Pro 


Pro 


Ala Gly Pro 










80 




Arg 


His 


Pro 


His 


Gly 


Ser His Leu 








95 




Phe 


Leu 


Phe 


Asn 


Thr 


Lys Xaa 










110 



Thr 


Gly 


Ala 


Pro Thr Pro Gly 




10 




15 


Arg 


Xaa 


Ser 


Cys Xaa Ala Ala 




25 




30 


Ala 


Pro 


Val 


Gly Ala Pro Thr 




40 




45 


Leu 


Ala 


Arg 


Thr Thr Ser Pro 




55 




60 


Ala 


Ala 


Glu 


Arg Gly Ser Pro 




70 




75 


Ala 


Ala 


Gin 


Arg His Gly Gly 




85 




90 


Asn 


Pro 


Gin 


His Pro lie Lys 




100 




105 



<210> 357 

<211> 73 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 815961 . 1 .orf 3 :2000FEB01 



<400> 357 

Cys Val Glu Glu Val Cys Glu Cys Lys Asp Val Glu Val Leu Pro 
15 10 15 

Val Leu Asn Glu Leu Ala Asp Gly Leu Val Pro Leu Val Val Thr 

20 25 30 

Val He Gly Gly Ala Val Trp Val Asp Pro Val Thr Leu Ser Val 

35 40 45 

Val Ser Gly Gly Met Val Pro Val Gly Val Glu Trp Met Glu Ala 

50 55 60 

Glu Val Asp He Cys Ala Trp Val Gly Val Met Thr Leu 

65 70 



<210> 358 
<211> 239 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LG:120744 . 1 .orf 1 .-2000MAY19 



<220> 

<221> unsure 



188/228 



WO 01/62927 



PCT/US01/06059 



<222> 199 

<223> unknown or other 
<400> 358 



Asn 


Glu 


Asn 


Arg 


Asn 


Met 


Ala 


His 


Tyr 


He 


Thr 


Phe Leu Cys Met 


1 








5 










10 




15 


Val 


Leu 


Val 


Leu 


Leu 


Leu 


Gin 


Asn 


Ser 


Val 


Leu 


Ala Glu Asp Gly 










20 










25 




30 


Glu 


Val 


Arg 


Ser 


Ser 


Cys 


Arg 


Thr 


Ala 


Pro 


Thr 


Asp Leu Val Phe 










35 








40 




45 


lie 


Leu 


Asp 


Gly Ser 


Tyr 


Ser 


Val 


Gly 


Pro 


Glu 


Asn Phe Glu He 










50 










55 




60 


Val 


Lys 


Lys 


Trp 


Leu 


Val 


Asn 


He 


Thr 


Lys 


Asn 


Phe Asp He Gly 










65 










70 




75 


Pro 


Lys 


Phe 


He 


Gin 


Val 


Gly 


Val 


Val 


Gin 


Tyr 


Ser Asp Tyr Pro 










80 










85 




90 


Val 


Leu 


Glu 


He 


Pro 


Leu 


Gly 


Ser 


Tyr 


Asp 


Ser 


Gly Glu His Leu 










95 










100 




105 


Thr 


Ala 


Ala 


Val 


Glu 


Ser 


He 


Leu 


Tyr 


Leu 


Gly 


Gly Asn Thr Lys 










110 










115 




120 


Thr 


Gly 


Lys 


Ala 


He 


Gin 


Phe 


Ala 


Leu 


Asp 


Tyr 


Leu Phe Ala Lys 










125 










130 




135 


Ser 


Ser 


Arg 


Phe 


Leu 


Thr 


Lys 


He 


Ala 


Val 


Val 


Leu Thr Asp Gly 










140 










145 




150 


Lys 


Ser 


Gin 


Asp Asp 


Vai 


Lys 


Asp 


Ala 


Ala 


Gin 


Ala Ala Arg Asp 










155 










160 




165 


Ser 


Lys 


He 


Thr 


Leu 


Phe 


Ala 


He 


Gly 


Val 


Gly 


Ser Glu Thr Glu 








170 








175 


180 


Asp 


Ala 


Glu 


Leu Arg 


Ala 


He 


Ala 


Asn 


Lys 


Pro 


Ser Ser Thr Tyr 








185 










190 




195 


Val 


Phe 


Tyr 


Xaa 


Glu 


Asp 


Tyr 


He 


Ala 


He 


Ser 


Lys He Arg Glu 










200 










205 




210 


Val 


Met 


Lys 


Gin 


Lys 


Leu 


Cys 


Glu 


Glu 


Ser 


Val 


Cys Pro Thr Arg 










215 










220 




225 


He 


Pro 


Val 


Ala 


Ala 


Arg 


Asp 


Glu 


Arg 


Gly 


Phe 


Asp He Leu 










230 










235 







<210> 359 
<211> 528 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 757520. 1 . orf 1 : 2000MAY01 
<400> 359 



Ala 


Ser 


His 


Ser 


Ala 


Ala 


Arg 


Thr 


Thr 


Phe 


Arg 


Cys Leu Ser Asp 


1 








5 










10 




15 


Ser 


Ala 


Gin 


Pro 


His 


Thr 


Met 


Ser 


Cys 


Arg 


Ser 


Tyr Arg He Ser 










20 








25 




30 


Ser 


Gly 


Cys 


Gly 


Val 


Thr 


Arg 


Asn 


Phe 


Ser 


Ser 


Cys Ser Ala Val 










35 










40 




45 


Ala 


Pro 


Lys 


Thr 


Gly 


Asn 


Arg 


Cys 


Cys 


He 


Ser 


Ala Ala Pro Tyr 










50 










55 




60 


Arg 


Gly 


Val 


Ser 


Cys 


Tyr 


Arg 


Gly 


Leu 


Thr 


Gly 


Phe Gly Ser Arg 










65 










70 




75 


Ser 


Leu 


Cys 


Asn 


Leu 


Gly 


Ser 


Cys 


Gly 


Pro 


Arg 


He Ala Val Gly 










80 










85 




90 


Gly 


Phe 


Arg 


Ala 


Gly 


Ser 


Cys 


Gly 


Arg 


Ser 


Phe 


Gly Tyr Arg Ser 










95 










100 




105 


Gly 


Gly 


Val 


Cys 


Gly 


Pro 


Ser 


Pro 


Pro 


Cys 


He 


Thr Thr Val Ser 










110 










115 




120 


Val 


Asn 


Glu 


Ser 


Leu 


Leu 


Thr 


Pro 


Leu 


Asn 


Leu 


Glu He Asp Pro 










125 










130 




135 


Asn 


Ala 


Gin 


Cys 


Val 


Lys 


Gin 


Glu 


Glu 


Lys 


Glu 


Gin He Lys Ser 



140 145 150 



189/228 



WO 01/62927 



PCT/US01/06059 



Leu 


Asn 


Ser 


Arg 


Phe 
155 


Ala 


Ala 


Glu 


Gin 


Gin 


Asn 


Lys 
170 


Leu 


Leu 


Asn 


Gin 


Arg 


Cys 


cys 

185 


Glu 


Ser 


Tyr 


He 


Glu 


Thr 


Leu 
200 


Arg 


Arg 


Ser 


Gly 


Arg 


Leu 


Ala 
215 


Ser 


Glu 


Glu 


Gly 


Tyr 


Lys 


Lys 
230 


Lys 


Tyr 


Thr 


Ala 


Glu 


Asn 


Glu 
245 


Phe 


Val 


Ala 


Tyr 


Leu 


Arg 


Lys 
260 


Ser 


Asp 


Val 


Glu 


Glu 


Ser 


Ser 
275 


Phe 


Leu 


Arg 


Val 


Leu 


Gin 


Ala 
290 


His 


He 


Met 


Asp 


Asn 


Ser 


Arg 
305 


Asp 


Leu 


He 


Lys 


Ala 


Gin 


Tyr 
320 


Asp 


Asp 


TV 1 - 

nxa 


Glu 


Ser 


Trp 


Tyr 
335 


Arg 


Ser 


Val 


He 


Arg 


His 


Gly 
350 


Glu 


Thr 


Asn 


Glu 


Leu 


Asn 


Arg 
365 


Met 


He 


Asn 


Ala 


Lys 


Cys 


Gin 
380 


Arg 


Ala 


Ala 


Glu 


Gin 


Gin 


Gly 
395 


Glu 


Ala 


Leu 


Ala 


Glu 


Leu 


Glu 


Gly 


Ala 










410 




Ala 


Cys 


Leu 


Leu 


Lys 
425 


Glu 


Tyr 


Gly 


Leu 


Asp 


He 


Glu 
440 


He 


Ala 


Glu 


Glu 


His 


Arg 


Leu 
455 


Cys 


Glu 


Val 


Ser 


Ser 


Ser 


Arg 
470 


Gly 


Gly 


Ser 


Thr 


Thr 


Pro 


Gly 
485 


Arg 


Gin 


Gly 


Gly 


Ser 


He 


Thr 
500 


Val 


Val 


Gin 


Pro 


Arg 


Ser 


Ser 
515 


Ser 


Phe 


Arg 


Phe 


Ala 











Phe 


He 


Asp 


Lys 


Val Arg Phe 


Leu 






160 






165 


Glu 


Thr 


Lys 


Trp 


Gin Phe Tyr 


Gin 






175 






180 


Asn 


Leu 


Glu 


Pro 


Leu Phe Ser 


Gly 






190 






195 


Glu 


Ala 


Glu 


Cys 


Val Glu Ala 


Asp 






205 




210 


Leu 


Asn 


His 


Val 


Gin Glu Val 


Leu 






220 






225 


Glu 


GlU 


Glu 


Val 


Ala Leu Arg 


Ala 






235 




240 


Val 


Leu 


Lys 


Lys 


Asp Val Asp 


Cys 






250 






255 


Leu 


Glu 


Ala 


Asn 


Val Glu Ala 


Leu 






265 






270 


Arg 


Arg 


Leu 


Tyr 


Glu Glu Glu 


He 






280 






285 


Ser 


Asp 


Thr 


Ser 


Val He Val 


Lys 






295 






300 


Asn 


Met 


Asp 


Cys 


He He Ala 


Glu 






310 






315 


Val 


Ala 


Ser 


Arg 


Ser Arg Ala 


Glu 






325 






330 


Lys 


Cys 


Glu 


Glu 


Met Lys Ala 


Thr 






340 






345 


Leu 


Arg 


Arg 


Thr 


Lys Glu Glu 


He 






355 






360 


Gin 


Arg 


Leu 


Thr 


Ala Glu He 


Glu 






370 






375 


Lys 


Leu 


Glu 


Ala 


Ala Val Ala 


Glu 




385 






390 


Ala 


Leu 


Ser 


Asp 


Ala Arg Cys 


Lys 






400 






405 


Leu 


Gin 


Lys 


Ala 


Lys Gin Asp 


Met 






415 






420 


Gin 


Glu 


Val 


Met 


Asn Ser Lys 


Leu 






430 






435 


Thr 


Tyr 


Arg 


Arg 


Leu Leu Glu 


Gly 






445 






450 


Gly 


Val 


Gly 


Ser 


Val Asn Val 


Cys 






460 






465 


Val 


Ser 


Cys 


Gly 


Gly Leu Ser 


Tyr 






475 






480 


He 


Thr 


Ser 


Gly 


Pro Ser Ala 


He 






490 




495 


Ala 


Pro 


Asp 


Ser 


Cys Ala Pro 


Cys 






505 






510 


Ser 


Cys 


Gly 


Ser 


Ser Arg Ser 


Val 






520 






525 



<210> 360 
<211> 157 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 160570. l.orf 2 :2000FEB18 
<220> 

<221> unsure 
<222> 150 

<223> unknown or other 



190/228 
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<400> 360 



Val 


Glu 


Gly Ala Ala 


Gly 


Arg 


1 






5 






Leu 


Glu 


Thr Thr Thr 


Thr 


Asp 








20 




Glu 


GlU 


Phe Tyr 


Cys 


Glu 


Val 








35 






Thr 


Leu 


Thr Thr 


Gin 


Gin 


Gin 








50 






Ser 


Val 


Val Leu 


Asp 


His 


Val 








65 






He 


Asp 


Tyr Phe 


Gly 


Leu 


Arg 








80 






Tyr 


Trp 


Leu Asp 


Pro 


Ala 


Lys 








95 






He 


Asn 


Thr Gly 


Pro 


Pro 


Tyr 








110 




Tyr 


Ala 


Glu Asp 


Pro 


Cys 


Lys 








125 






Ser 


He 


Asp Phe 


Val 


Phe 


Glu 






140 






Lys 


Ala 


Leu Phe 


Lys 


Thr 


Asn 



155 



Leu Gly 


Gly 


Ser 


Phe Leu Ser Trp 




10 




15 


Met Gly 


cys 


Phe 


Cys Ala Val Pro 




25 




30 


Leu Leu 


Leu 


Asp 


Glu Ser Lys Leu 




40 




45 


Gly He 


Lys 


Lys 


Ser Thr Lys Gly 




55 




60 


Phe His 


His 


Val 


Asn Leu Val Glu 




70 




75 


Tyr Cys 


Asp 


Arg 


Ser His Gin Thr 




85 




90 


Thr Leu 


Ala 


Glu 


His Lys Glu Leu 




100 




105 


Thr Leu 


Tyr 


Phe 


Gly He Lys Phe 




115 




120 


Leu Lys 


Glu 


Glu 


He Thr Arg Tyr 




130 




135 


Gin He 


His 


Ala 


Leu Arg He Xaa 




145 




150 



<21G> 361 

<211> 65 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LI:350398.3 ,orf3 :2000FEB01 



<220> 

<221> unsure 

<222> 22-23, 56, 65 

<223> unknown or other 



<400> 361 

Gly Leu Pro Lys Pro Gly Ala Leu Val Gly Asp Arg Ala Ala Pro 

15 10 15 

Ala Trp Val Gin Pro Pro Xaa Xaa Gin Val Asn Arg Phe His Lys 

20 25 30 

He Arg Asn Arg Ala Leu Leu Leu Thr Asp Gin His Leu Tyr Lys 

35 40 45 

Leu Asp Pro Asp Arg Gin Tyr Arg Val Met Xaa Ala Val Pro Leu 

50 55 60 
Glu Ala Val Thr Xaa 

65 



<210> 362 
<211> 517 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc__feature 

<223> Incyte ID No: LI:221285. 1 .orf 3 :2000FEB01 
<400> 362 

Leu Ala Ala Arg Gly Val Leu Ser Arg Gly Gin Pro Gly Ser Ala 
15 10 15 

Ala Ala Pro Arg Gin Glu Lys Gin Pro Arg Thr Pro Trp Lys Ser 

20 * 25 30 

Arg Ser Thr Trp Ala Cys Glu Asn Gly Ala Asn Thr Ser Pro Ala 

35 40 45 

Lys Pro His Ser Lys Ala Gly Pro Arg Thr Ala Thr Val Ala Pro 
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Gin 


Ala 


Ser Arg 


Met 


Thr 


Val 








65 






Ala 


Arg 


Asn Leu 


Ala 


Lys 


Arg 








80 






Ala 


Gly 


Arg Ser 


Gin 


GlU 


Pro 








95 






Pro 


Gly 


Lys Pro 


Gly 


Asp 


Arg 








110 






Asp 


Arg 


Thr Leu 


Gly 


cys 


Trp 








125 






Gly 


Leu 


Cys Gly 


Ala 


Gly 


Ser 








140 






Cys 


Ala 


Gly Asn 


Leu 


Pro 


Ser 






155 






Leu 


Leu 


Ala Ser 


Arg 


Asn 


Pro 








170 






Ser 


Gly 


Ser His 


Asn 


Thr 


Leu 






185 






Leu 


His 


Arg Lys 


Arg 


Gly 


Ser 








200 






Leu 


Thr 


Asp Arg 


Thr 


Ser 


Arg 








215 






Ser 


Arcr 

S3 


Pro Arg 


Ala 


Leu 


Pro 








230 






Ser 


Leu 


Thr Gin 


Pro 


Asd 


Glu 








245 




Met 


Tyr 


Asp Arg 


Arcr 


Val 


He 








260 






Thr 


Gly 


Pro Leu 


Leu 


Gly 


Arcr 








275 






Gin 


Arg 


Glu Ala 


Arg 


Lys 


Arg 








290 






Glu 


Gin 


Leu Arg 


Pro 


Gin 


Thr 






305 






His 


Val 


Asp Val 


Gin 


Thr 


Glu 








-320 






Arg 


lie 


He Glu 


Val 


Asp 


Met 








335 






Asp 


Arg 


Pro Pro 


Thr 


Pro 


Leu 






350 






Asp 


Val 


Ala Thr 


Gin 


lie 


Leu 






3 65 






Leu 


Glu 


Val Lys 


Pro 


Val 


Leu 






380 






Glu 


Gin 


Ser Leu 


Leu 


Glu 


Val 








395 






Leu 


Arg 


Ala Ser 


Gin 


Arg 


Glu 








410 






Arg 


Ala 


Glu Val 


Gin 


Arg 


Leu 








425 






Glu 


Glu 


Lys Glu 


Arg 


Arg 


Lys 








440 






Lys 


His 


Asn Glu 


Thr 


Ser 


Gin 






455 






Gin 


Arg 


Tyr Leu 


Ala 


Asp 


Leu 








470 






Arg 


Asp 


Ser Gly 


Tyr 


Phe 


Tyr 








485 






lie 


Gly 


Phe Leu 


Pro 


Trp 


Leu 






500 




Glu 


Tyr 


Ser Met 


Val 


Gly 


Arg 



515 

<210> 363 
<211> 60 
<212> PRT 







55 




60 


LVS 


Pro 


Ala 


Lys Ala Ala Ser 


Leu 




70 




75 


Arg 


Arcr 


Thr 


Tyr Leu Gly Gly 


Ala 






85 




90 


Glu 


Val 


Pro 


Cys Ala Ala Val 


Leu 






100 




105 


Asn 


Cys 


Pro 


Glu Phe Pro Pro 


Pro 




115 




120 


Ala 


Thr 


Asp 


Ala Ala Pro Ala 


Ala 






130 




135 


Glu 


Pro 


Ser 


He Ala Pro Thr 


Ser 






145 




150 


Arg 


Pro 


Pro 


Pro Leu Leu Ser 


Pro 




160 




165 


Cvs 


Pro 


Trp 


His Tyr Leu His 


Leu 






175 




180 


Ala 


Pro 


Thr 


Cys Phe Lys Ala 


Lys 






190 




195 


Gin 


Pro 


Pro 


Asp Met Ala Ser 


Ala 






205 


210 


Ala 


Pro 


Ser 


Thr Tyr Thr Tyr 


Thr 






220 


225 


Cvs 


Gin 


Arg 


Ser Arg Tvr Arg 


AST) 






235 




240 


GlU 


Pro 


Met 


His Tyr Gly Asn 


He 






250 


255 


Arg 


Glv 


Asn 


Thr Tyr Ala Leu 


Gin 






265 




270 


Pro 




Ser 


Leu Glu Leu Gin 


Arcr 




280 




285 


Ala 


Leu 


Ala 


Arg Lys Gin Ala 


Gin 






295 




300 


Pro 


Glu 


Pro 


Val Glu Gly Arg 


Lys 






310 




315 


Leu 


Tyr 


Leu 


Glu Glu He Ala 


Asp 




325 




330 


Glu 


Cys 


Gin 


Thr Asp Ala Phe 


Leu 




340 




345 


Phe 


He 


Pro 


Ala Lys Thr Gly 


Lys 






355 




360 


Glu 


Gly 


Glu 


Leu Phe Asp Phe 


Asp 




370 




375 


Glu 


Val 


Leu 


Val Gly Lys'* Thr 


lie 






385 


390 


Met 


Glu 


Glu 


Glu Glu Leu Ala 


Asn 






400 




405 


Tyr 


Glu 


Glu 


Leu Arg Asn Ser 


Glu 




415 




420 


Glu 


Glu 


Gin 


Glu Arg Arg His 


Arg 






430 




435 


Lys 


Gin 


Gin 


Trp Glu He Met 


His 




445 


450 


Lys 


He 


Ala 


Ala Arg Ala Phe 


Ala 






460 




465 


Leu 


Pro 


Ser 


Val Phe Gly Ser 


Leu 






475 




480 


Asp 


Pro 


He 


Glu Arg Asp He 


Glu 






490 




495 


Met 


Asn 


Glu 


Val Glu Lys Thr 


Met 






505 


510 
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<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 401605 .2 . orf 2 :2000FEB01 
<400> 363 

Ala Glu Arg Gly Leu Arg Thr Leu Leu Ser Leu Glu Asp Glu Arg 
15 10 15 

Met Cys His Ser Gly Gin Met Gly Ser Leu Leu Gly Thr Val Cys 

20 25 30 

Ser Glu Ser Val Pro Ser Thr Pro Lys Lys Pro Pro Lys Ser Trp 

35 40 45 

Ala Ser Leu Trp Asn Gin Gin He Leu His Phe Gly Ala Tyr Lys 

50 55 60 



<210> 364 
<211> 239 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI:329017. 1 .orf 1:200 0FEB01 
<400> 364 

He Tyr Thr Glu Val Glu Gin Leu Gly Trp Lys Leu Tyr Gly Asp 

15 10 15 

Lys Leu Ala Thr Ser Ser Gly Asp Thr Thr Val Lys Leu Trp Asp 

20 25 30 

Leu Cys Thr Gly Asp Cys He Leu Thr Phe Glu Gly His Ser Arg 

35 40 45 

Ala Val Trp Ser Cys Thr Trp His Ser Cys Gly Asn Phe Val Ala 

50 55 60 

Ser Ser Ser Leu Asp Lys Thr Ser Lys He Trp Asp Val Asn Ser 

65 70 75 

Glu Arg Cys Arg Cys Thr Leu Tyr Gly His Thr Asp Ser Val Asn 

80 85 90 

Ser He Glu Phe Phe Pro Phe Ser Asn Thr Leu Leu Thr Ser Ser 

95 100 105 

Ala Asp Lys Thr Leu Ser He Trp Asp Ala Arg Thr Gly He Cys 

110 115 120 

Glu Gin Ser Leu Tyr Gly His Met His Ser He Asn Asp Ala He 

125 130 135 

Phe Asp Pro Arg Gly His Met He Ala Ser Cys Asp Ala Cys Gly 

140 145 150 

Val Thr Lys Leu Trp Asp Phe Arg Lys Leu Leu Pro He Val Ser 

155 160 165 

He Asp He Gly Pro Ser Pro Gly Asn Glu Val Asn Phe Asp Ser 

170 175 180 

Ser Gly Arg Val Leu Ala Gin Ala Ser Gly Asn Gly Val He His 

185 190 195 
Leu Leu Asp Leu Lys Ser Gly Glu He His Lys Leu Met Gly His 

200 205 210 
Glu Asn Glu Ala His Thr Val Val Phe Ser His Asp Gly Glu He 

215 220 225 
Leu Phe Ser Gly Gly Ser Asp Gly Thr Val Arg Thr Trp Ser 
230 235 

<210> 365 
<211> 160 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 



193/228 



WO 01/62927 



PCTAJS01/06059 



<223> Incyte ID No: LI : 401322 . l.orf 1:2Q00FEB01 
<400> 365 



Ala 


Thr 


Gin 


Trp Glu 


Ser 


Pro 


Pro Gly 


Asp 


Thr 


Thr Gly Gly 


Ser 


1 






5 








10 






15 


Pro 


Gly 


He 


Tyr Lys 


Val 


Pro 


Pro Ala 


Thr 


Ala 


Arg Trp Asp 


Ser 








20 








25 






30 


Trp 


cys 


Cys 


Trp Arg 


Pro 


Val 


Trp Cys 


Leu 


Gin 


Ser Ala Thr 


Leu 








35 








40 






45 


Gin 


Leu 


Gin 


Leu Leu 


Asp 


Gin 


Pro Cys 


Val 


Ser 


Ala Ser Pro 


Ser 








50 






55 






60 


Met 


Trp 


Ala 


Arg Pro 


Glu 


Cys 


Arg Trp 


Ala 


Met 


Pro Ala Gly 


Ser 








65 








70 






75 


Cys 


Thr 


Ala 


Trp Ser 


Thr 


Thr 


Ser Ser 


Pro 


Val 


Ala Pro cys 


Pro 






80 








85 






90 


Ala 


Thr 


Arg 


Pro Trp 


Gly 


Ala 


Val He 


Thr 


Pro 


Ser Thr Pro 


Ser 






•i 95 






100 






105 


Ser 


Gly 


Arg 


Pro Ser 


Leu 


Ala 


Gly Met 


Cys 


Pro 


Gly Leu Ser 


Val 








110 








115 






120 


Asp 


Leu 


Glu 


Pro Ala 


Val 


He 


Gly Trp 


His 


Gin 


Leu Pro Val 


Pro 






125 








130 






135 


His 


Ser 


Gly 


Ala Arg 


Gly 


Cys 


Cys Ser 


Gin 


Gly 


Ala Ala Gly 


Ser 








140 








145 






150 


Leu 


Arg 


Ala 


Lys Gin 


Tyr 


His 


Ser His 


His 









155 160 

<210> 366 
<211> 757 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No:- LG:403409 . 1 ,orf3 : 2000MAY19 
<400> 366 



Gly 


Arg 


Gly 


Arg Arg 


Lys 


Pro 


Asn Glu 


Phe 


Leu 


Gly Gly Cys 


Arg 


1 








5 








10 






15 


Met 


Gly 


Asp 


Ser 


Lys 


Val 


Lys 


Val Ala 


Val 


Arg 


He Arg Pro 


Met 










20 








25 






30 


Asn 


Arg 


Arg 


Glu 


Thr 


Asp 


Leu 


His Thr 


Lys 


Cys 


Val Val Asp 


Val 










35 








40 






45 


Asp 


Ala 


Asn 


Lys 


Val 


He 


Leu 


Asn Pro 


Val 


Asn 


Thr Asn Leu 


Ser 








50 








55 






60 


Lys 


Gly 


Asp 


Ala Arg 


Gly 


Gin 


Pro Lys 


Val 


Phe 


Ala Tyr Asp 


His 










65 








70 






75 


Cys 


Phe 


Trp 


Ser 


Wet 


Asp 


Glu 


Ser Val 


Lys 


Glu 


Lys Tyr Ala 


Gly 










80 








85 






90 


Gin 


Asp 


He 


Val 


Phe 


Lys 


Cys 


Leu Gly 


Glu 


Asn 


He Leu Gin 


Asn 








95 






100 






105 


Ala 


Phe 


Asp 


Gly 


Tyr 


Asn 


Ala 


Cys He 


Phe 


Ala 


Tyr Gly Gin 


Thr 










110 








115 






120 


Gly 


Ser 


Gly 


Lys 


Ser 


Tyr 


Thr 


Met Met 


Gly 


Thr 


Ala Asp Gin 


Pro 










125 








130 






135 


Gly 


Leu 


He 


Pro 


Arg 


Leu 


Cys 


Ser Gly 


Leu 


Phe 


Glu Arg Thr 


Gin 










140 








145 






150 


Lys 


Glu 


Glu 


Asn 


Glu 


Glu 


Gin 


Ser Phe 


Lys 


Val 


Glu Val Ser 


Tyx 










155 








160 






165 


Met 


GlU 


He 


Tyr 


Asn 


Glu 


Lys 


Val Arg 


Asp 


Leu 


Leu Asp Pro 


Lys 










170 








175 






180 


Gly 


Ser 


Arg 


Gin 


Thr 


Leu 


Lys 


Val Arg 


Glu 


His 


Ser Val Leu 


Gly 










185 






190 






195 


Pro 


Tyr 


Val 


Asp 


Gly 


Leu 


Ser 


Lys Leu 


Ala 


Val 


Thr Ser Tyr 


Lys 










200 








205 






210 


Asp 


He 


Glu 


Ser 


Leu 


Met 


Ser 


Glu Gly 


Asn 


Lys 


Ser Arg Thr 


Val 










215 








220 






225 


Ala 


Ala 


Thr 


Asn 


Met 


Asn 


Glu 


Glu Ser 


Ser 


Arg 


Ser His Ala 


Val 
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230 



Phe 


Lys 


He Thr 


Leu 


Thr 


His 






245 






Thr 


Ser 


Gly Glu 


Lys 


Val 


Gly 






260 






Gly 


Ser 


Glu Arg 


Ala 


Thr 


Lys 






275 






Lys 


Glu 


Gly Ser 


Asn 


He 


Asn 






290 






Val 


He 


Ser Ala 


Leu 


Ala 


Asp 








305 






Lys 


Phe 


Val Pro 


Tyr 


Arg 


Asp 








320 






Asp 


Ser 


Leu Gly 


Gly 


Asn 


Ser 








335 






Ser 


Pro 


Ala Ala 


Asp 


Asn 


Tyr 








350 






Tyr 


Ala 


Asp Arg 


Ala 


Lys 


His 








365 






Glu 


Asp 


Pro Asn 


Ala 


Arg 


lie 








380 






Glu 


Lys 


Leu Arg 


Glu 


Gin 


Leu 








395 






Pro 


Glu 


Leu Lys 


Asp 


Arg 


Leu 








410 






Glu 


Met 


Thr Val 


Thr 


Trp 


Glu 








425 




He 


Ala 


Gin Glu 


Arg 


Gin 


Lys 








440 




Leu 


Gin 


Ser Ser 


Gly 


He 


Lys 








455 






Val 


Asn 


Leu Asn 


Ala 


Asp 


Pro 








470 




Tyr 


Leu 


Lys Glu 


His 


Thr 


Leu 








485 






He 


Gin 


Leu cys 


Gly 


Met 


Gly 






500 




Asp 


He 


Thr Ser 


Glu 


Gly 


Gin 






515 






Thr 


Arg 


Thr Phe 


Val 


Asn 


Gly 








530 




Leu 


His 


His Gly 


Asp 


Arg 


He 








545 






Arg 


Leu 


Asn Leu 


Pro 


Lys 


Lys 








560 






Glu 


Asp 


Gin Asp 


Pro 


Ser 


Met 






575 






Leu 


Asp 


Val Asp 


Gly 


Asp 


Ser 








590 






Asn 


Phe 


Asn Tyr 


Glu 


Tyr 


Ala 








605 






Leu 


Gly 


Ser Asn 


Asp 


Pro 


Met 








620 






Gin 


Gin 


His Glu 


Glu 


Glu 


Lys 








635 




Leu 


Met 


Tyr Glu 


His 


Glu 


Leu 






650 






Pro 


Glu 


Lys Gin 


Asn 


Cys 


Arg 








665 






Ser 


Pro 


Ser Ala 


Gin 


Gin 


Arg 








680 




Glu 


Ala 


Thr Leu 


'Asn 


Asn 


Ser 








695 






Val 


Lys 


Ala Asn 


Leu 


Leu 


Val 






710 






Glu 


Leu 


Asp Lys 


Arg 


Thr 


Glu 







235 




240 


Thr 


Leu 


Tyr 


Asp 


Val Lys Ser Gly 






250 




255 


Lys 


Leu 


Ser 


Leu 


Val Asp Leu Ala 






265 




270 


Thr 


Gly 


Ala 


Ala 


Gly Asp Arg Leu 






280 




285 


Lys 


Ser 


Leu 


Thr 


Thr Leu Gly Leu 






295 




300 


Gin 


Ser 


Ala 


Gly 


Lys Asn Lys Asn 






310 




315 


Ser 


Val 


Leu 


Thr 


Trp Leu Leu Lys 






325 




330 


Lys 


Thr 


Ala 


Met 


Val Ala Thr Val 




340 




345 


Asp 


Glu 


Thr 


Leu 


Ser Thr Leu Arg 




355 




360 


He 


Val 


Asn 


His 


Ala Val Val Asn 






370 




375 


He 


Arg 


Asp 


Leu 


Arg Glu Glu Val 






385 




390 


Thr 


Lys 


Ala 


Glu 


Ala Met Lys Ser 






400 




405 


Glu 


Glu 


Ser 


Glu 


Lys Leu He Gin 






415 




420 


Glu 


Lys 


Leu 


Arg 


Lys Thr Glu Glu 






430 




435 


Gin 


Leu 


Glu 


Ser 


Leu Gly He Ser 






445 




450 


Val 


Gly 


Asp 


Asp 


Lys Cys Phe Leu 






460 




465 


Ala 


Leu 


Asn 


Glu 


Leu Leu Val Tyr 






475 




480 


He 


Gly 


Ser 


Ala 


Asn Ser Gin Asp 




490 




495 


He 


Leu 


Pro 


Glu 


His Cys He He 






505 




510 


Val 


Met 


Leu 


Thr 


Pro Gin Lys Asn 






520 




525 


Ser 


Ser 


Val 


Ser 


Ser Pro He Gin 






535 




540 


Leu 


Trp 


Gly 


Asn 


Asn His Phe Phe 




550 




555 


Lys 


Lys 


Lys 


Ala 


Glu Arg Glu Asp 






565 




570 


Lys 


Asn 


Glu 


Asn 


Ser Ser Glu Gin 




580 




585 


Ser 


Ser 


Glu 


Val 


Ser Ser Glu Val 






595 




600 


Gin 


Met 


Glu 


Val 


Thr Met Lys Ala 






610 




615 


Gin 


Ser 


He 


Leu 


Asn Ser Leu Glu 






625 




630 


Arg 


Ser 


Ala 


Leu 


Glu Arg Gin Arg 






640 




645 


Glu 


Gin 


Leu 


Arg 


Arg Arg Leu Ser 






655 




660 


Ser 


Met 


Asp 


Arg 


Phe Ser Phe His 






670 


675 


Leu 


Arg 


Gin 


Trp 


Ala Glu Glu Arg 






685 




690 


Leu 


Met 


Arg 


Leu 


Arg Glu Gin He 






700 




705 


Arg 


Glu 


Ala 


Asn 


Tyr He Ala Glu 




715 




720 


Tyr 


Lys 


Val 


Thr 


Leu Gin He Pro 


730 




735 
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Ala Ser Ser Leu Asp Ala Asn Arg Lys Arg Gly Ser Leu Leu Ser 
740 745 750 

Glu Pro Ala lie Gin Val Arg 
755 
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<221> misc_feature 
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Pro Arg Gin His Phe Cys Met Gin Tyr Ser Leu Gin Val Val Thr 
15 10 15 

Ala Thr Cys Lys Phe Gly Met Asn Ala Leu Leu Leu Ser Ala Trp 
20 25 30 

Phe Gly His Leu Arg lie Leu Gin lie Leu Val Asn Ser Gly Ala 
35 40 45 

Lys lie His cys Glu Ser Lys Glu Gly Asn Thr Ala Leu His Leu 
50 55 60 

Ala Ala Gly Arg Gly His Met Ala Val Leu Gin Arg Leu Val Asp 
65 70 75 

lie Gly Leu Asp Leu Glu Glu Gin Asn Ala Glu Gly Leu Thr Ala ' 

80 85 90 

Leu His Ser Ala Ala Gly Gly Ser His Pro Asp Cys Val Gin Leu 
95 100 105 

Leu Leu Arg Ala Gly Ser Thr Val Asn Ala Leu Thr Gin Lys Asn 
110 115 120 

Leu Ser Cys Leu His Tyr Ala Ala Leu Ser Gly Ser Glu Asp Val 
125 130 135 

Ser Arg Val Leu He His Ala Gly Gly Cys Ala Asn Val Val Asp 
140 145 150 

His Gin Gly Ala Ser Pro Leu His Leu Ala Val Arg 
155 160 

<210> 368 
<2U> 635 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI .-290344 . 1 . orf 2 :2000MAY01 
<400> 368 

Ala Leu Val Phe Met Gin Pro Met Val Met Gin Gly Cys Pro Tyr 
15 10 15 

Thr Leu Pro Arg Cys His Asp Trp Gin Ala Ala Asp Gin Phe His 

20 25 30 

His Ser Ser Ser Leu Arg Ser Thr Cys Pro His Pro Gin Val Arg 

35 40 45 

Ala Ala Val Thr Ser Pro Ala Pro Pro Gin Asp Gly Ala Gly Val 

50 55 60 

Pro Cys Leu Ser Leu Lys Leu Leu Asn Gly Ser Val Gly Ala Ser 

65 70 75 

Gly Pro Leu Glu Pro Pro Ala Met Asn Leu Cys Trp Asn Glu He 

80 85 90 

Lys Lys Lys Ser His Asn Leu Arg Ala Arg Leu Glu Ala Phe Ser 

95 100 105 

Asp His Ser Gly Lys Leu Gin Leu Pro Leu Gin Glu He He Asp 
110 115 120 

Trp Leu Ser Gin Lys Asp Glu Glu Leu Ser Ala Gin Leu Pro Leu 
125 130 • 135 

Gin Gly Asp Val Ala Leu Val Gin Gin Glu Lys Glu Thr His Ala 
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140 



Ala 


Phe 


Met Glu 


Glu 


Val 


Lys 








155 






Val 


Leu 


Glu Ser 


Ala 


Gin 


Ala 








170 






Glu 


Leu 


Glu Glu 


Pro 


His 


Ser 








185 






Gin 


Arg 


He Gin 


Asn 


Leu 


Ser 








200 






Val 


Ala 


Ser Glu 


Leu 


Trp 


Glu 








215 




Gin 


His 


Arg His 


He 


Glu 


Arg 






230 






Gin 


Gly 


Ala Met 


Glu 


Glu 


Leu 








245 






Gly 


Val 


Arg Ala 


Thr 


Trp 


Glu 








260 






Ser 


Leu 


Pro Glu 


His 


He 


Gin 








275 






Phe 


Ser 


Pro Met 


Lys 


Asp 


Gly 








290 






His 


Gin 


Leu Ala 


He 


Ser 


Asp 








305 




Gin 


Ala 


Leu Glu 


Gin 


lie 


Asn 








320 






Ser 


Val 


Asp Glu 


Arg 


Leu 


Lys 








335 






Phe 


Gly 


Pro Gly 


Ser 


Gin 


His 








350 






Pro 


Trp 


Glu Arg 


Ala 


He 


Ser 








365 






Asn 


His 


Gin Ala 


Gin 


Thr 


Thr 








380 






Glu 


Leu 


Tyr Gin 


Thr 


Leu 


Ala 






395 






Ala 


Tyr 


Arg Thr 


Ala 


Met 


Lys 








410 






Arg 


Leu 


Asp Leu 


Val 


Thr 


Leu 








425 






Glu 


His 


Asp Leu 


Gin 


Ala 


Ser 






440 






Val 


lie 


His Cys 


Leu 


Thr 


Ala 






455 






Arg 


Gly 


He Leu 


Val 


Asn 


Val 






470 






Asn 


Trp 


Leu Leu 


Asn 


Val 


Phe 






485 






Arg 


Ala 


Leu Ser 


Phe 


Lys 


Thr 








500 




Glu 


Val 


Lys Glu 


Lys 


Leu 


Gin 






515 






Ser 


Gly 


Ser Gin 


Cys 


Asp 


Gin 








530 






Glu 


Ala 


He Gin 


Val 


Pro 


Arg 








545 






Gly 


Gly 


Ser Asn 


Val 


Glu 


Pro 




560 






Ser 


Thr 


Gly Lys 


Pro 


Val 


He 






575 






Val 


Asn 


Leu Glu 


Pro 


Gin 


Ser 








590 






Arg 


Val 


Thr He 


Ala 


Glu 


Gin 






605 






lie 


Cys 


Ser Ala 


Val 


Pro 


His 






620 






Glu 


Ala 


He Gin 


Arg 









145 




150 


Ser 


Arg Ala 


Pro Tyr 


He Tyr Ser 




160 




165 


Phe 


Leu Ser 


Gin His 


Pro Phe Glu 




175 




180 


Glu 


Ser Lys 


Asp Thr 


Ser Pro Lys 




190 




195 


Arg 


Phe Val 


Trp Lys 


Gin Ala Thr 




205 




210 


Lys 


Leu Thr 


Ala Arg 


Cys Val Asp 




220 




225 


Thr 


Leu Glu 


Gin Leu 


Leu Glu He 




235 




240 


Ser 


Thr Thr 


Leu Ser 


Gin Ala Glu 




250 




255 


Pro 


He Gly 


Asp Leu 


Phe He Asp 




265 




270 


Ala 


He Lys 


Leu Phe 


Lys Glu Glu 




280 




285 


Val 


Lys Leu 


Val Asn 


Asp Leu Ala 




295 




300 


Val 


His Leu 


Ser Met 


Glu Asn Ser 




310 




315 


Val 


Arg Trp 


Lys Gin 


Leu Gin Ala 




325 




330 


Gin 


Leu Gin 


Asp Ala 


His Arg Asp 




340 




345 


Phe 


Leu Ser 


Ser Ser 


Val Gin Val 




355 




360 


Pro 


Asn Lys 


Val Pro 


Tyr Tyr He 




370 




375 


Cys 


Trp Asp 


His Pro 


Lys Met Thr 




385 




390 


Asp 


Leu Asn 


Asn He 


Lys Phe Ser 




400 




405 


Leu 


Arg Arg 


Val Gin 


Lys Ala Leu 




415 




420 


Thr 


Thr Ala 


Leu Glu 


He Phe Asn 




430 




435 


Glu 


His Val 


Met Asp 


Val Val Glu 




445 


450 


Leu 


Tyr Glu 


Arg Leu 


Glu GlU Glu 




460 




465 


Pro 


Leu Cys 


Val Asp 


Met Ser Leu 




475 




480 


Asp 


Ser Gly 


Arg Ser 


Gly Lys Met 




490 




495 


Gly 


He Ala 


Cys Leu 


Cys Gly Thr 




505 




510 


Tyr 


Leu Phe 


Ser Gin 


Val Ala Asn 


520 




525 


Arg 


His Leu 


Gly Val 


Leu Leu His 


535 


540 


Gin 


Leu Gly 


Glu Val 


Ala Ala Phe 




550 




555 


Ser 


Val Arg 


Ser Cys 


Phe Arg Phe 




565 


570 


Glu 


Ala Ser 


Gin Phe 


Leu Glu Trp 




580 




585 


Met 


Val Trp 


Leu Pro 


Val Leu His 




595 




600 


Val 


Lys His 


Gin Thr 


Lys Cys Ser 




610 




615 


Gin 


Gly Val 


Gin Val 


Pro Glu Ser 




625 




630 
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Gin 


He 


Trp 


Met 


Lys 


Ser 


Thr 


Gin 


He 


He 


Gly 


Leu Gin Tyr He 


1 








5 










10 




15 


Ser 


Leu 


Pro 


Leu 


Gly 


Met 


Lys 


Glu 


He 


Gly 


Leu 


Lys Tyr Lys Arg 










20 










25 




30 


Asp 


Pro 


Arg 


Thr 


Asn 


Glu 


Gly 


He 


Leu 


Lys 


Val 


Val Lys Ala Leu 










35 










40 




45 


Asp 


Tyr 


Glu 


Gin 


Leu 


Gin 


Ser 


Val 


Lys 


Leu 


Ser 


He Ala Val Lys 






50 










55 




60 


Asn 


Lys 


Ala 


Glu 


Phe 


His 


Gin 


Ser 


Val 


He 


Ser 


Arg Tyr Arg Val 








65 










70 




75 


Gin 


Ser 


Thr 


Pro 


Val 


Thr 


He 


Gin 


Val 


He 


Asn 


Val Arg Glu Gly 










80 










85 




90 


lie 


Ala 


Phe 


Arg 


Pro 


Ala 


Ser 


Lys 


Thr 


Phe 


Thr 


Val Gin Lys Gly 










95 








100 




105 


lie 


Ser 


Ser 


Lys 


Lys 


Leu 


Val 


Asp 


Tyr 


He 


Leu 


Gly Thr Tyr Gin 










110 










115 




120 


Ala 


He 


Asp 


Glu 


Asp 


Thr 


Asn 


Lys 


Ala 


Ala 


Ser 


Asn Val Lys Tyr 








125 










130 




135 


Val 


Met 


Gly 


Arg 


Asn 


Asp 


Gly 


Gly 


Tyr 


Leu 


Met 


He Asp Ser Lys 










140 










145 




150 


Thr 


Ala 


Glu 


He 


Lys 


Phe 


Val 


Lys 


Asn 


Met 


Asn 


Arg Asp Ser Thr 










155 










160 




165 


Phe 


He 


Val 


Asn 


Lys 


Thr 


He 


Thr 


Ala 


Glu 


Val 


Leu Ala He Asp 










170 










175 




180 


Glu 


Tyr 


Thr 


Gly 


Lys 


Thr 


Ser 


Thr 


Gly 


Thr 


Val 


Tyr Val Arg Val 










185 










190 




195 


Pro 


Asp 


Phe 


Asn 


Asp 


Asn 


Cys 


Pro 


Thr 


Ala 


Val 


Leu Glu Lys Asp 










200 










205 




210 


Ala 


Val 


Cys 


Ser 


Ser 


Ser 


Pro 


Ser 


Val 


Val 


Val 


Ser Ala Arg Thr 










215 










220 




225 


Leu 


Asn 


Asn 


Arg 


Tyr 


Thr 


Gly 


Pro 


Tyr 


Thr 


Phe 


Ala Leu Glu Asp 










230 










235 




240 


Gin 


Pro 


Val 


Lys 


Leu 


Pro 


Ala 


Val 


Trp 


Ser 


He 


Thr Thr Leu Asn 








245 










250 




255 


Ala 


Thr 


Ser 


Ala 


Leu 


Leu 


Arg 


Ala 


Gin 


Glu 


Gin 


lie Pro Pro Gly 










260 








265 




270 


Val 


Tyr 


His 


He 


Ser 


Leu 


Val 


Leu 


Thr 


Asp 


Ser 


Gin Asn Asn Arg 








275 










280 




285 


Cys 


Glu 


Met 


Pro 


Arg 


Ser 


Leu 


Thr 


Leu 


Glu 


Val 


Cys Gin Cys Asp 










290 










295 




300 


Asn 


Arg 


Gly 


He 


Cys 


Gly 


Thr 


Ser 


Tyr 


Pro 


Thr 


Thr Ser Pro Gly 










305 










310 




315 


Thr 


Arg 


Tyr 


Gly 


Arg 


Pro 


His 


Ser 


Gly 


Arg 


Leu 


Gly Pro Ala Ala 










320 










325 




330 


lie 


Gly 


Leu 


Leu 


Leu 


Leu 


Gly 


Leu 


Leu 


Leu 


Leu 


Leu Leu Ala Pro 








335 








340 




345 


Leu 


Leu 


Leu 


Leu 


Thr 


Cys 


Asp 


Cys 


Gly 


Ala 


Gly 


Ser Thr Gly Gly 










350 










355 




360 


Val 


Thr 


Gly 


Gly 


Phe 


He 


Pro 


Val 


Pro 


Asp 


Gly 


Ser Glu Gly Thr 










365 










370 


375 


He 


His 


Gin 


Trp 


Gly 


He 


Glu 


Gly 


Ala 


His 


Pro 


Glu Asp Lys Glu 










380 








385 




390 


He 


Thr 


Asn 


He 


Cys 


Val 


Pro 


Pro 


Val 


Thr 


Ala 


Asn Gly Ala Asp 










395 










400 




405 


Phe 


Met 


Glu 


Ser 


Ser 


Glu 


Val 


Cys 


Thr 


Asn 


Thr 


Tyr Ala Arg Gly 










410 








415 




420 


Thr 


Ala 


Val 


Glu 


Gly 


Thr 


Ser 


Gly 


Asn 


Gly 


Asn 


Asp His 
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425 430 

<210> 370 
<211> 531 
<212> PRT 
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<220> 
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<221> unsure 

<222> 148, 291 

<223> unknown or other 

<400> 370 



Ala 


Cys 


His 


Leu 


Pro 


Pro 


Cys 


Leu 


Leu 


Leu 


Ala Ala Leu Asn 


Ala 


1 






5 








10 




15 


Trp 


Ser 


Phe 


Lys 


Leu 


Leu 


Leu 


Gly 


Leu 


Thr 


Lys Gin Gly Pro 


Cys 










20 










25 




30 


Leu 


Pro 


Leu 


Ala 


Thr 


Glu 


Glu 


Asp 


Ser 


Val 


Asn Thr Asn Pro 


Ser 










35 








40 




45 


Thr 


Glu 


Asp 


Glu 


Leu 


Leu 


Ala 


Ser 


Leu 


Ser 


Ala Glu Glu Leu 


Lys 








SO 










55 




60 


Glu 


Leu 


Glu 


Arg 


Glu 


Leu 


Glu 


Asp 


He 


Glu 


Pro Asp Arg Asn 


Leu 










65 










70 




75 


Pro 


Val 


Gly 


Leu 


Arg 


Gin 


Lys 


Ser 


Leu 


Thr 


Glu Lys Thr Pro 


Thr 










80 










85 




90 


Gly 


Thr 


Phe 


Ser 


Arg 


Glu 


Ala 


Leu 


Met 


Ala 


Tyr Trp Glu Lys 


Glu 








95 










100 




105 


Ser 


Gin 


Lys 


Leu 


Leu 


Glu 


Lys 


Glu 


Arg 


Leu 


Gly Glu Cys Gly 


Lys 










110 










115 




120 


Val 


Ala 


Glu 


Asp 


Lys 


GlU 


GlU 


Ser 


Glu 


Glu 


Glu Leu He Phe 


Thr 








125 










130 




135 


Glu 


Ser 


Asn 


Ser 


Glu 


Val 


Ser 


Glu 


GlU 


Val 


Tyr Thr Xaa Glu 


Glu 










140 










145 




150 


Glu 


Glu 


Glu 


Ser 


Gin 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu Asp Ser Asp 


Glu 










155 










160 




165 


Glu 


'Glu 


Arg 


Thr 


He 


Glu 


Thr 


Ala 


Lys 


Gly 


lie Asn Gly Thr 


Val 








170 








175 




180 


Asn 


Tyr 


Asp 


Ser 


Val 


Asn 


Ser 


Asp 


Asn 


Ser 


Lys Pro Lys He 


Phe 








185 










190 




195 


Lys 


Ser 


Gin 


He 


Glu 


Asn 


He 


Asn 


Leu 


Thr 


Asn Gly Ser Asn 


Gly 










200 










205 




210 


Arg 


Asn 


Thr 


Glu 


Ser 


Pro 


Ala 


Ala 


He 


His 


Pro Cys Gly Asn 


Pro 








215 










220 


225 


Thr 


Val 


He 


Glu 


Asp 


Ala 


Leu 


Asp 


Lys 


He 


Lys Ser Asn Asp 


Pro 










230 










235 




240 


Asp 


Thr 


Thr 


Glu 


Val 


Asn 


Leu 


Asn 


Asn 


He 


Glu Asn lie Thr 


Thr 








245 










250 




255 


Gin 


Thr 


Leu 


Thr 


Arg 


Phe 


Ala 


Glu 


Ala 


Leu 


Lys Asp Asn Thr 


Val 










260 










265 




270 


Val 


Lys 


Thr 


Phe 


Ser 


Leu 


Ala 


Asn 


Thr 


His 


Ala Asp Asp Ser 


Ala 










275 










280 




285 


Ala 


Met 


Ala 


He 


Ala 


Xaa 


Met 


Leu 


Lys 


Val 


Asn Glu His He 


Thr 










290 








295 




300 


Asn 


Val 


Asn 


Val 


Glu 


Ser 


Asn 


Phe 


lie 


Thr 


Gly Lys Gly lie 


Leu 










305 










310 


315 


Ala 


lie 


Met 


Arg 


Ala 


Leu 


Gin 


His 


Asn 


Thr 


Val Leu Thr Glu 


Leu 










320 










325 




330 


Arg 


Phe 


His 


Asn 


Gin 


Arg 


His 


He 


Met 


Gly 


Ser Gin Val Glu 


Met 










335 










340 




345 


Glu 


He 


Val 


Lys 


Leu 


Leu 


Lys 


Glu 


Asn 


Thr 


Thr Leu Leu Arg 


Leu 










350 










355 




360 


Gly 


Tyr 


His 


Phe 


Glu 


Leu 


Pro 


Gly 


Pro 


Arg 


Met Ser Met Thr 


Ser 



365 370 375 
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lie 


Leu 


Thr Arg 


Asn 
380 


Met 


Asp 


Lys 


Glu 


Gin 


Lys Gin 


Gin 
395 


Glu 


Gly 


Tyr 


Thr 


Lys 


Val Trp 


Gin 
410 


Arg 


Gly 


Thr 


Ser 


Pro 


Arg His 


Ser 


Pro 


Trp 


Ser 








425 






Val 


Gin 


Thr Val 


Arg 
440 


Ser 


Arg 


Pro 


Pro 


Pro 


Pro Arg 


Asp 
455 


Ser 


Ser 


Thr 


Gin 


Lys 


His Cys 


Arg 
470 


Ser 


His 


Gin 


Gly 


He 


Thr Lys 


Trp 
485 


Thr 


Lys 


Lys 


GlU 


Thr 


Ala Lys 


Gin 
500 


Tyr 


Ser 


Lys 


Val 


Ser 


Ala Arg 


Glu 
■515 


Glu 


Asn 


Gly 


Pro 


Thr 


Glu He 


Ser 
530 


Ser 







<210> 371 
<211> 257 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI: 2837 62.1 

<400> 371 



Lys 


Ala 


Phe 


Phe 


He 


Trp 


Trp 


Gin 


1 








5 








Gin 


Ser 


Phe 


Lys 


Phe 


Leu 


Glu 


Gin 








20 








He 


■Leu 


Lys 


Lys 


Thr 


His 


His 


Pro 








35 








Phe 


Val 


Ser 


Arg 


Phe 
50 


Ser 


Ala 


Met 


Pro 


Glu 


He 


Arg 


Thr 
65 


Glu 


He 


Arg 


Gin 


Gly 


Val 


Val 


Arg 
80 


Lys 


Thr 


Val 


He 


Trp 


Glu 


Pro 


Gin 


His 


Met 


Asp 








95 






Phe 


Asn 


Met 


Gin 


Lys 
110 


He 


Glu 


Glu 


Pro 


Ser 


Ser 


Pro 


Ser 


Ala 


Thr 


Asp 










125 






Leu 


Ala 


Glu 


Asn 


Cys 
140 


Phe 


Arg 


Glu 


Gly 


Asn 


Met 


Asn 


Asn 


Ala 


Val 


Arg 








155 






His 


His 


Lys 


Leu 


Trp 
170 


Asp 


Pro 


Asn 


Lys 


He 


He 


Met 


Tyr 


Ser 


He 


Gin 








185 








He 


Gin 


Glu 


He 


Leu 


Gly 


His 


Leu 










200 






Pro 


Arg 


Val 


Arg 


Ala 
215 


Gly 


He 


He 


Ala 


He 


Ala 


Ala 


Lys 


Gly 


Ser 


He 










230 






Phe 


Asn 


Thr 


Leu 


Leu 


Lys 


His 


Leu 



245 



Gin 


Arg 


Gin 


Lys Arg 


Leu Gin 




385 






390 


Asp 


Gly 


Gly 


Pro Asn 


Leu Arg 




400 






405 


Pro 


Ser 


Ser 


Ser Pro 


Tyr Val 




415 






420 


Ser 


Pro 


Lys 


Leu Pro 


Lys Lys 




430 






435 


Leu 


Ser 


Pro 


Val Ala 


Thr Pro 




445 






450 


Pro 


Arg 


Glu 


Lys Ala 


His Tyr 




460 






465 


Thr 


Thr 


Gly 


Glu Cys 


Pro Thr 




475 




480 


Glu 


Lys 


Arg 


Glu Lys 


Gly Gin 




490 






495 


Gly 


Asn 


Lys 


Lys Phe 


Ser Glu 




505 






510 


Arg 


Gin 


Phe 


Pro Thr 


Phe Tyr 




520 






525 



orf2:2000MAY01 



Ser 


Cys 


Trp Asn Arg 


Gly Asn 




10 




15 


He 


Leu 


Trp Ser Asn 


Leu Gin 




25 




30 


Thr 


His 


Arg Arg Tyr 


Asp Phe" 




40 




45 


Cys 


His 


Ser Cys His 


Ser Asp 




55 




60 


He 


Ala 


Gly He Arg 


Gly He 




70 




75 


Asn 


Asp 


Glu Leu Arg 


Ala Thr 




85 




90 


Lys 


He 


Val Pro Ser 


Leu Leu 


100 




105 


Val 


Asp 


Ser Arg He 


Gly Pro 




115 




120 


Lys 


Glu 


Glu Asn Pro 


Ala Val 




130 




135 


Leu 


Leu 


Gly Arg Ala 


Thr Phe 




145 




150 


Pro 


Val 


Phe Ala His 


Leu Asp 




160 




165 


Glu 


Phe 


Ala Val His 


Cys Phe 




175 




180 


Ala 


Gin 


Tyr Ser His 


His Val 




190 




195 


Asp 


Ala 


Arg Lys Lys 


Asp Ala 




205 




210 


Gin 


Val 


Leu Leu Glu 


Ala Val 




220 




225 


Gly 


Pro 


Thr Val Leu 


Glu Val 


235 




240 


Arg 


Leu 


Ser Val Glu 


Phe Glu 


250 




255 



200/228 



WO 01/62927 PCT/US01/06059 
Ser Lys 



<210> 372 
<211> 242 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI :347687 . 113 .orfl :2 000MAY01 
<400> 372 



Gin 


Pro 


Cys Gly 


Phe 


Gin 


Gly 


Ala 


Glu 


Asn 


Arg 


Arg 


Lys Leu Ala 


1 






5 










10 






15 


Tyr 


Met 


Arg Thr 


Asp 


Trp 


Pro 


Glu 


Glu 


Gin 


Gin 


Leu 


Leu Val Ala 




20 








25 






30 


Leu 


Phe 


Cys Gly 


Cys 


Gly 


His 


Glu 


Ala 


Leu 


Thr 


Gly 


Thr Glu Lys 






35 










40 






45 


Leu 


He 


Glu Thr 


Tyr 


Phe 


Ser 


Lys 


Asn 


Tyr 


Gin 


Asp 


Tyr Glu Tyr 








50 








55 






60 


Leu 


He 


Asn Val 


He 


His 


Ala 


Phe 


Gin 


Tyr 


Val 


He 


Tyr Gly Thr 








65 










70 






75 


Ala 


Ser 


Phe Phe 


Phe 


Leu 


Tyr 


Gly 


Ala 


Leu 


Leu 


Leu 


Ala Glu Gly 








80 










85 






90 


Phe 


Tyr 


Thr Thr 


Gly 


Ala 


Val 


Arg 


Gin 


He 


Phe 


Gly 


Asp Tyr Lys 








95 










100 






105 


Thr 


Thr 


He Cys 


Gly 


Lys 


Gly 


Leu 


Ser 


Ala 


Thr 


Phe 


Val Gly He 








110 








115 






120 


Thr 


Tyr 


Ala Leu 


Thr 


Val 


Val 


Trp 


Leu 


Leu 


Val 


Phe 


Ala Cys Ser 








125 










130 






135 


Ala 


Val 


Pro Val 


Tyr 


He 


Tyr 


Phe 


Asn 


Thr 


Trp 


Thr 


Thr Cys Gin 








140 








145 






150 


Ser 


He 


Ala Phe 


Pro 


Ser 


Lys 


Thr 


Ser 


Ala 


Ser 


He 


Gly Ser Leu 








155 








160 






165 


cys 


Ala 


Asp Ala 


Arg 


Met 


Tyr 


Gly 


Val 


Leu 


Pro 


Trp 


Asn Ala Phe 








170 










175 






180 


Pro 


Gly 


Lys Val 


Cys 


Gly 


Ser 


Asn 


Leu 


Leu 


Ser* 


He 


cys Lys Thr 








185 










190 






195 


Ala 


Glu 


Phe Gin 


Met 


Thr 


Phe 


His 


Leu 


Phe 


He 


Ala 


Ala Phe Val 








200 










205 






210 


Gly 


Ala 


Ala Ala 


Thr 


Leu 


Val 


Ser 


Leu 


Leu 


Thr 


Phe 


Met He Ala 






215 










220 






225 


Ala 


Thr 


Tyr Asn 


Phe 


Ala 


Val 


Leu 


Lys 


Leu 


Met 


Gly 


Arg Gly Thr 








230 










235 






240 



Lys Phe 



<210> 373 

<211> 60 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 1146510 . 1 .orf2 :2000MAY01 
<400> 373 

Thr Glu Leu Gin Arg Pro Arg Ser Ala Ser He Tyr Ser Arg Tyr 
15 10 15 

Ala Ser Ser Asn Phe Arg Arg Cys Gly Val Glu Leu Lys Tyr Ser 

20 25 30 

Ser Lys Asn Arg Glu Met Thr Asp Thr Thr Asp Ala Val His Asn 

35 40 45 

Cys Asn Ala Arg Tyr Ser Ser Pro Arg Pro Lys Leu Leu Glu Asp 

50 55 60 



201/228 



WO 01/62927 



PCT/US01/06059 



<210> 374 
<211> 157 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 451710 . l.orf 1:200 0FEB18 
<400> 374 



Pro 


Gly 


Thr Pro 


Asn 


Pro 


Ala 


Pro Pro 


Pro 


Pro 


Arg Val His 


Pro 


1 






5 








10 






15 


Ser 


Ser 


Trp Arg 


Ala 


Pro 


He 


Gin Glu 


Met 


Ala 


Val Pro Leu 


Leu 






20 








25 






30 


Thr 


Lys 


Lys lie 


Val 


Lys 


Lys 


Arg Val 


Lys 


Gin 


Phe Lys Arg 


Pro 








35 








40 






45 


His 


Leu 


Asp Arg 


Tyr 


Lys 


Cys 


Leu Lys 


Pro 


Ser 


Trp Arg Arg 


Pro 








50 








55 






60 


Lys 


Gly 


lie Asp 


Ser 


Arg 


Val 


Arg Arg 


Lys 


Phe 


Lys Gly Cys 


Thr 








65 








70 






75 


Leu 


Met 


Pro Asn 


He 


Gly 


Tyr 


Gly Ser 


Asp 


Lys 


Ser Thr Arg 


His 








80 








85 






90 


Tyr 


Leu 


Pro Asn 


Lys 


Phe 


Lys 


Lys Phe 


Val 


Val 


His Asn Val 


Ser 








95 








100 






105 


GlU 


Leu 


Glu Leu 


Leu 


Met 


Met 


His Asn 


Arg 


Thr 


Tyr Cys Ala 


Glu 








110 








115 






120 


lie 


Ala 


His Asn 


Val 


Ser 


Thr 


Lys Lys 


Arg 


Lys 


Glu He Val 


Glu 








125 






130 






135 


Arg 


Ala 


Ala Gin 


Leu 


Asp 


He 


Val Val 


Thr 


Asn 


Lys Leu Ala 


Arg 








140 








145 






150 


Leu 


Arg 


Ser Gin 


Glu 


Asp 


Glu 


















155 















<210> 375 
<211> 158 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 455771 . 1 .orf 3 :2000FEB18 
<400> 375 



Ala 


Ser 


Cys Ser 


Arg 


Arg 


Arg 


Glu Ala 


Leu 


Gin Arg Thr Ser 


Val 


1 




5 








10 




15 


Asn 


Met 


Gly Lys 


Thr 


Arg 


Gly 


Met Gly 


Ala 


Gly Arg Lys Leu 


Lys 








20 








25 




30 


Thr 


His 


Arg Arg 


Asn 


Gin 


Arg 


Trp Ala 


Asp 


Lys Ala Tyr Lys 


Lys 








35 








40 




45 


Ser 


His 


Leu Gly 


Asn 


Glu 


Trp 


Lys Lys 


Pro 


Phe Ala Gly Ser 


Ser 








50 








55 




60 


His 


Ala 


Lys Gly 


He 


Val 


Leu 


Glu Lys 


He 


Gly He Glu Ala 


Lys 








. 65 








70 




75 


Gin 


Pro 


Asn Ser 


Ala 


He 


Arg 


Lys Cys 


Ala 


Arg Val Gin Leu 


Val 








80 








85 




90 


Lys 


Asn 


Gly Lys 


Lys 


He 


Ala 


Ala Phe 


Val 


Pro Asn Asp Gly 


Cys 








95 








100 




105 


Leu 


Asn 


Tyr He 


Glu 


Glu 


Asn 


Asp GlU 


Val 


Leu He Ala Gly 


Phe 






110 






115 


120 


Gly 


Arg 


Lys Gly 


His 


Ala 


Val 


Gly Asp 


He 


Pro Gly Val Arg 


Phe 








125 






130 




135 


Lys 


Val 


Val Lys 


Val 


Ser 


Gly 


Val Ser 


Leu 


Leu Ala Leu Phe 


Lys 






140 






145 




150 


Glu 


Lys 


Lys Glu 


Lys 


Pro 


Arg 


Ser 









155 
<210> 376 



202/228 



WO 01/62927 



PCT/US01/06059 



<211> 238 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:452089 . 1 .orf 2 :2000FEB18 
<400> 376 



Leu 


Gly 


Lys 


His Arg 


Arg Pro Pro Pro 


Pro 


Lys 


Asp 


Gly Arg Arg 


1 






5 




10 






15 


Arg 


Gly 


Tyr 


Gly Arg 


Leu Leu Ala Pro 


Met 


Ala 


His 


Glu Lys Lys 








20 




25 






30 


Leu 


Ser 


Asn 


Pro Met 


Arg Glu lie Lys 


Val 


Gin 


Lys 


Leu Val Leu 








35 


40 






45 


Asn 


lie 


Ser 


Val Gly 


Glu Ser Gly Asp 


Arg 


Leu 


Thr 


Arg Ala Ala 








50 




55 






60 


Lys 


Val 


Leu 


Glu Gin 


Leu Ser Gly Gin 


Thr 


Pro 


Val 


Phe Ser Lys 






65 


70 






75 


Ala 


Arg 


Tyr 


Thr Val 


Arg Ser Phe Gly 


He 


Arg 


Arg 


Asn Glu Lys 








80 




85 






90 


lie 


Ala 


Cys 


Tyr Val 


Thr Val Arg Gly 


GlU 


Lys 


Ala 


Met Gin Leu 








95 




100 






105 


Leu 


Glu 


Ser 


Gly Leu 


Lys Val Lys Glu 


Tyr 


Glu 


Leu 


Leu Arg Arg 








110 




115 






120 


Asn 


Phe 


Ser 


Asp Thr 


Gly Cys Phe Gly 


Phe 


Gly 


He 


Gin Glu His 








125 




130 






135 


lie 


Asp 


Leu 


Gly lie 


Lys Tyr Asp Pro 


Ser 


Thr 


Gly 


He Tyr Gly 








140 




145 






150 


Met 


Asp 


Phe 


Tyr Val 


Val Leu Glu Arg 


Ala 


Gly 


Tyr 


Arg Val Ala 








155 




160 






165 


Arg 


Arg 


Arg 


Arg Cys 


Lys Ser Arg Val 


Gly 


He 


Gin 


His Arg Val 








170 




175 






180 


Thr 


Lys 


Glu 


Asp Ser 


Met Lys Trp Phe 


Gin 


Val 


Lys 


Tyr Glu Gly 








185 




190 






195 


Val 


lie 


Leu 


Lys Gin 


Gly Ser Gly Leu 


His 


Val 


Pro 


Pro Leu Thr 








200 




205 






210 


Cys 


Gly 


Gin 


Asn Ser 


Ser Leu Val Ser 


Ser 


Pro 


Pro 


Pro Cys Gin 




215 




220 






225 


Arg 


Lys 


Thr 


Thr Thr 


His Leu Ala Arg 


Leu 


Phe 


Trp 


Val 






230 




235 









<210> 377 
<211> 102 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 246415 . 1 .orf 3 :2000FEB18 
<400> 377 



Leu 


Pro 


Pro 


Val 


Arg 


Ala 


Ser Asn Met 


Met 


Lys 


Lys 


Arg Arg Asn 


1 








5 






10 






15 


Asn Gly 


Arg 


Thr 


Lys 


Lys 


Gly Arg Gly 


His 


Val 


Gin 


Pro He Cys 










20 






25 






30 


Asp 


Thr 


Asn 


Cys 


Ala 
35 


Gin 


Cys Val Pro 


Lys 
40 


Asp 


Lys 


Ala He Asn 
45 


Lys 


Phe 


He 


He 


Gly 


Asn 


Thr Val Glu 


Ala 


Ala 


Ala 


Val Arg Asp 








50 






55 






60 


He 


Ser 


Glu 


Ala 


Ser 


Val 


Phe Asp Ala 


Tyr 


Val 


Leu 


Pro Lys Leu 










65 




70 






75 


Tyr 


Leu 


Lys 


Leu 


His 
80 


Tyr 


Cys Leu Ser 


Cys 
85 


Ala 


He 


His Ser Arg 
90 


Val 


Val 


Arg 


Asn 


Arg 
95 


Ser 


Cys Glu Ala 


His 
100 


Lys 


Asp 





203/228 



WO 01/62927 



PCT/US01/06059 



<210> 378 
<211> 102 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 414144 . 10 . orf 1 : 20 00FEB18 
<400> 378 



Val Tyr 


Trp Cys 


Arg 


Glu Leu 


He Leu Lys 


Arg 


Gly Gin 


Ala Lys 


1 




5 




10 






15 


Val Lys 


Asn Lys 


Thr 


He Pro 


Leu Thr Asp 


Asn 


Thr Val 


He Glu 






20 




25 






30 


Glu His 


Leu Gly 


Lys 


Phe Gly 


Val He Cys 


Leu 


Glu Asp 


Leu He 






35 




40 




45 


His Glu 


He Ala 


Phe 


Pro Gly 


Lys His Phe 


Gin 


Glu He 


Ser Trp 






50 




55 






60 


Phe Leu 


Cys Pro 


Phe 


His Leu 


Ser Val Ala 


Arg 


His Ala 


Thr Lys 






65 




70 






75 


Asn Arg 


Val Gly 


Phe 


Leu Lys 


Glu Met Gly 


Thr 


Pro Gly 


Tyr Arg 






80 




85 






90 


Gly Glu 


Arg He 


Asn 


Gin Leu 


He Arg Gin 


Leu 


Asn . 








95 




100 









<210> 379 
<211> 177 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1101445 . 1 .orf3 :2000FEB18 
<400> 379 



Gly Thr 


Met Glu 


Ala 


Val Pro Glu Lys 


Lys 


Lys 


Lys Val 


Ala Ala 


1 




5 




10 






15 


Ala Pro 


Gly Thr 


Leu 


Lys Lys Lys Lys 


Val 


Pro 


Ala Val 


Pro Glu 






20 




25 






30 


Thr Leu 


Lys Lys 


Lys 


Arg Arg Asn Phe 


Ala 


Glu 


Leu Lys 


Val Lys 






35 




40 






45 


Arg Leu 


Arg Lys 


Lys 


Phe Ala Leu Lys 


Thr 


Leu 


Arg Lys 


Ala Arg 






50 




55 






60 


Arg Lys 


Leu He 


Tyr 


Glu Lys Ala Lys 


His 


Tyr 


His Lys 


Glu Tyr 






65 




70 






75 


Arg Gin 


Met Tyr 


Arg 


Thr Glu He Arg 


Met 


Ala 


Arg Met 


Ala Arg 






80 




85 






90 


Lys Ala 


Gly Asn 


Phe 


Tyr Val Pro Ala 


Glu 


Pro 


Lys Leu 


Ala Phe 






95 




100 






105 


Val He 


Arg He 


Arg 


Gly He Asn Gly 


Val 


Ser 


Pro Lys 


Val Arg 






110 




115 






120 


Lys Val 


Leu Gin 


Leu 


Leu Arg Leu Arg 


Gin 


He 


Phe Asn 


Gly Thr 






125 


130 






135 


Phe Val 


Lys Leu 


Asn 


Lys Ala Ser Val 


Asn 


Met 


Leu Arg 


He Val 






140 




145 




150 


Glu Pro 


Tyr He 


Ala 


Trp Gly Val Pro 


Gin 


Pro 


Glu Val 


Ser Lys 






155 




160 






165 


Arg Ala 


His Leu 


Gin 


Thr Arg Leu Trp 


Gin 


Asn 


Gin 








170 


175 









<210> 380 

<211> 86 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 



204/228 



WO 01/62927 



PCT/US01/06059 



LG : 452134 . 1 . orf 2 : 2000FEB18 



Pro Glu lie Val Pro Ala Thr Leu Arg Arg 

10 15 
Gly Arg Pro Gin Pro Lys Gly Leu Lys Gly 

25 30 
Leu Thr Arg Gly Glu Ala Asp Arg Asp Thr 

40 45 
Val Pro Pro Asp Ala Asp Arg Lys Ala Glu 

55 60 
Gly Ser Glu Thr Glu Phe Gin Phe Arg Gly 

70 75 
Gly Gly Gin Pro Pro Gin 

85 



LI: 903021.1. orf 1:2000FEB01 



<223> Incyte ID No: 
<400> 380 

Leu Pro Ser Tyr Ala 
1 5 
Ser His Ser Glu Thr 
20 

Glu Arg Pro Ala Arg 
35 

Tyr Arg Gin lie Ala 
50 

Ala Glu Ala Gly Ala 
65 

Arg Phe Gly cys Gly 
80 

<210> 381 

<211> 97 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 

<400> 381 

Arg Asn Phe Thr Glu 
1 5 
Ala Gin Lys Met Leu 
20 

Lys Ala Lys His Tyr 
35 

Glu lie Gin lie Ser 
50 

Val Ser Ala Glu Pro 
65 

Tyr Gin Leu Gly Glu 
80 

Leu Pro Ser Ser Asn 
95 



Leu 


Lys 


lie Lys 


Arg 








10 


Leu 


Lys 


Ala Arg 


Arg 








25 


His 


Lys 


Glu Tyr 


Met 








40 


Arg 


lie 


Ala Arg 


Lys 








55 


Lys 


Leu 


Ala Phe 


Val 








70 


Pro 


Lys 


Gly Leu 


Lys 








85 


Leu 


Gin 







Leu 


Arg Asn Lys 


Phe 






15 


Lys 


Leu lie Tyr 


Glu 






30 


Gin 


Met Tyr Arg 


Thr 






45 


Ala 


Gly Asn Phe 


Tyr 






60 


lie 


Arg lie Gly 


Gly 






75 


Gly 


Val Ala Thr 


Ser 






90 



<210> 382 

<211> 82 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID NO: LI:246422 . 1 .orf 1 :2000FEB01 
<400> 382 



Lys Lys Arg Lys 


Gin 


Val Pro 


Lys Phe Thr 


Leu Asp Arg Thr 


His 


1 


5 




10 




15 


Pro Val Glu Asp 


Gly 


He Met 


Asp Ala Ala 


Asn Phe Glu Gin 


Phe 




20 




25 




30 


Phe Gin Glu Arg 


He 


Lys Met 


Asn Gly Lys 


Ala Gly Asn Phe 


Gly 




35 




40 




45 


Gly Gly Val Val 


Thr 


His Arg 


Arg Glu Gin 


Glu Gin Asp Gin 


Arg 




50 




55 




60 


Asp lie Gin Ala 


Ala 


Leu Phe 


Gin Gin Val 


Phe Glu He Ser 


His 




65 




70 




75 


Gin Lys lie Ser 


Glu 


Glu Glu 










80 











<210> 383 
<211> 180 
<212> PRT 



205/228 



WO 01/62927 



PCT/US01/06059 



<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 449404. 1 .orf 1 :2000MAY19 
<400> 383 



Ser 


Gin 


Pro Lys 


Ser 


Cys 


Leu 


Arg Ser 


Gly 


His 


Pro Ser 


Leu His 


1 




5 




10 






15 


Ala 


Thr 


Met Ser 


Arg 


Arg 


Lys 


Thr Arg 


Glu 


Pro 


Lys Glu 


Glu Asn 








20 








25 






30 


Val 


Thr 


Leu Gly 


Pro 


Thr 


Val 


Arg Glu 


Gly 


Glu 


Tyr Val 


Phe Gly 








35 








40 






45 


Val 


Ala 


His He 


Phe 


Ala 


Ser 


Phe Asn 


Asp 


Thr 


Phe He 


His He 








50 








55 






60 


Thr 


Asp 


Leu Ser 


Gly 


Arg 


Glu 


Thr Leu 


Val 


Arg 


He Thr 


Gly Gly 








65 








70 






75 


Met 


Lys 


Val Lys 


Ala 


Asp 


Arg 


Asp Glu 


Ser 


Ser 


Pro Tyr 


Ala Ala 








80 








85 






90 


Met 


Leu 


Ala Ala 


Gin 


Asp 


Val 


Ala Gin 


Arg 


Cys 


Lys Glu 


Leu Gly 








95 








100 






105 


lie 


Thr 


Ala Leu 


His 


He 


Lys 


Leu Arg 


Ala 


Thr 


Gly Gly 


Asn Lys 








110 






115 






120 


Thr 


Lys 


Thr Pro 


Gly 


Pro 


Gly 


Ala Gin 


Ser 


Ala 


Leu Arg 


Ala Leu 






12b 






130 






135 


Ala 


Arg 


Ser Gly 


Met 


Lys 


He 


Gly Arg 


He 


Glu 


Asp Val 


Thr Pro 








140 








145 






150 


Val 


Pro 


Thr Asp 


Ser 


Thr 


Arg 


Arg Lys 


Gly 


Gly 


Arg Arg 


Gly Lys 








155 








160 






165 


Glu 


Asp 


Cys Arg 


Arg 


His 


His 


Tyr Cys 


Val 


Pro 


Phe Ala 


Gly Ser 








170 








175 






180 



<210> 384 
<211> 118 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 449413 . 1. orf 3 :2000MAY19 
<400> 384 



Pro 


Arg 


Cys 


Phe Arg 


Leu Pro Gin Arg 


Arg 


Arg Pro Ser 


Gin Pro 


1 




5 




10 


Arg Gly Met 


15 


Val 


Pro 


Ser 


Ser Ala 


Thr Met Gly Lys 


Thr 


Gly Ala 








20 


25 




30 


Gly 


Arg 


Lys 


Leu Lys 


Thr His Arg Arg 


Asn 


Gin Arg Trp 


Ala Asp 








35 




40 




45 


Lys 


Ala 


Tyr 


Lys Lys 


Ser His Leu Gly 


Asn 


Glu Trp Lys 


Lys Pro 








50 




55 




60 


Phe 


Ala 


Gly 


Ser Ser 


His Ala Lys Gly 


He 


Val Leu Glu 


Lys He 






65 


70 




75 


Gly 


He 


Glu 


Ala Lys 


Gin Pro Asn Ser 


Ala 


He Arg Lys 


Cys Ala 








80 




85 




90 


Arg 


Val 


Gin 


Leu Val 


Lys Asn Gly Lys 


Lys 


He Ala Ala 


Phe Val 








95 




100 




105 


Pro 


Asn 


Asp 


Gly Cys 


Leu Asn Tyr He 


Glu 


Glu Asn Val 










110 


115 







<210> 385 
<211> 164 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 
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<223> Incyte ID 


No: 


LG: 450105 . 1 .orf 2 : 2000MAY19 




<400> 385 














Pro 


Ser 


Ala Ala 


Arg 


Pro 


Pro Tyr Ser Arg 


Tyr 


Arg Ala Arg 


Arg 


1 






5 




10 






15 


His 


His 


Leu Arg 


Arg 


Ser 


Asn Met Gly Lys 


Thr 


Arg Gly Met 


Gly 








20 




25 






30 


Ala 


Gly 


Arg Lys 


Leu 


Lys 


Thr His Arg Arg 


Asn 


Gin Arg Trp 


Ala 








35 




40 






45 


Asp 


Lys 


Ala Tyr 


Lys 


Lys 


Ser His Leu Gly 


Asn 


Glu Trp Lys 


Lys 








50 




55 






60 


Pro 


Phe 


Ala Gly 


Ser 


Ser 


His Ala Lys Gly 


He 


Val Leu Glu 


Lys 






65 




70 






75 


lie 


Gly 


He Glu 


Ala 


Lys 


Gin Pro Asn Ser 


Ala 


He Arg Lys 


Cys 








80 




85 






90 


Ala 


Arg 


Val Gin 


Leu 


Val 


Lys Asn Gly Lys 


Lys 


He Ala Ala 


Phe 








95 




100 






105 


Val 


Pro 


Asn Asp 


Gly 


Cys 


Leu Asn Tyr He 


Glu 


Glu Asn Asp 


Glu 








110 




115 






120 


Val 


Leu 


He Ala 


Gly 


Phe 


Gly Arg Lys Gly 


His 


Ala Val Gly 


Asp 








125 




130 






135 


He 


Pro 


Gly Val 


Arg 


Phe 


Lys Val Val Lys 


Val 


Ser Gly Val 


Ser 








140 




145 






150 


Leu 


Leu 


Ala Leu 


Phe 


Lys 


Glu Lys Lys Glu 


Lys 


Pro Arg Ser 










155 


160 









<210> 386 
<211> 101 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:460809 . l.orf 3 :2000MAY19 
<400> 386 

Ala Trp Val Glu Trp Ala Ser Arg Ser Ala Pro Arg Ala His Arg 
15 10 15 

Glu He Gin Lys Phe Ala Met Lys Glu Met Gly Thr Pro Asn Leu 

20 25 30 

His He Asp Val Arg Leu Asn Lys Ala Leu Trp Ala Lys Gly lie 

35 40 45 

Arg Asn Val Pro Tyr His He His Met Lys Leu Pro Arg Lys Leu 

50 55 60 

Asn Glu Asp Glu Asp Ser Pro Asp Lys Leu Tyr Ala Leu Val Pro 

65 70 75 

Thr Tyr Thr Cys Tyr His Phe His Lys Ser He Asp Arg Gin Cys 

80 85 90 

Gly Arg Glu Leu Thr Thr Asp Gly Ser He His 

95 100 

<210> 387 
<211> 259 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG:481781 . 1 . orf3 :2000MAY19 
<400> 387 

Arg Ser Val Arg Arg Arg Ser Ser Ser Ser Arg Arg Arg Arg Val 
15 10 15 

Ala Ala Pro His Leu Glu Leu Ala Thr Met Ala Arg Gly Leu Lys 

20 25 30 

Lys His Leu Lys Arg Leu Asn Ala Pro Lys His Trp Met Leu Asp 

35 40 45 
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Lys 


Leu 


Gly 


Gly 


Ala 


Phe 


Ala 


Pro 


Lys 


Pro 


Ser 


Ser Gly Pro 


His 








50 










55 






60 


Lys 


Ser 


Arg 


Glu 


cys 


Leu 


Pro 


Leu 


He 


Leu 


He 


He Arg Asn 


Arg 






65 










70 






75 


Leu 


Lys 


Tyr 


Ala 


Leu 


Thr 


Tyr 


Arg 


Glu 


Val 


He 


Ser He Leu 


Met 








80 










85 






90 


Gin 


Arg 


His 


Val 


Leu 


Val 


Asp 


Gly 


Lys 


Val 


Arg 


Thr Asp Lys 


Thr 










95 










100 






105 


Tyr 


Pro 


Ala 


Gly 


Phe 


Met 


Asp 


Val 


He 


Ser 


He 


Pro Lys Thr 


Asn 








110 










115 






120 


Glu 


Asn 


Tyr 


Arg 


Leu 


Leu 


Tyr 


Asp 


Thr 


Lys 


Gly 


Arg Phe Arg 


Leu 








125 










130 






135 


His 


Pro 


He 


Arg 


Asp 


Glu 


Asp 


Ala 


Lys 


Phe 


Lys 


Leu Cys Lys 


Val 










140 










145 






150 


Arg 


Ser 


Val 


Gin 


Phe 


Gly 


Gin 


Lys 


Gly 


lie 


Pro 


Tyr Leu Asn 


Thr 








155 










160 






165 


Tyr 


Asp 


Gly 


Arg 


Thr 


He 


Arg 


Tyr 


Pro 


Asp 


Pro 


Leu He Lys 


Ala 










170 










175 






180 


Asn 


Asp 


Thr 


He 


Lys 


He 


Asp 


Leu 


Glu 


Thr 


Asn 


Lys He Val 


Asp 










185 










190 






195 


Phe 


He 


Lys 


Phe 


Asp 


Val 


Gly 


Asn 


Val 


Val 


Met 


Val Thr Gly 


Gly 










200 










205 






210 


Arg 


Asn 


Thr 


Gly 


Arg 


Val 


Gly 


Val 


He 


Lys 


Asn 


Arg Glu Lys 


His 










215 










220 






225 


Lys 


Gly 


Ser 


Phe 


Glu 


Thr 


He 


His 


Val 


Glu 


Asp 


Ser Trp Ala 


Thr 






230 










235 






240 


Gly 


Ser 


Pro 


Pro 


Val 


Trp 


Ala 


Thr 


Cys 


Ser 


Pro 


Ser Ala Arg 


Val 










245 






250 






255 


He 


Ser 


Arg 


Gly 





















<210> 388 
<211> 184 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:1101153 .l.orf2 :2000MAY19 
<400> 388 



Arg 


Arg 


Arg 


Pro 


Phe 


Leu 


Leu 


Arg 


Ser 


Phe 


Ala Ala Ala Met 


Val 


1 




5 










10 




15 


Lys 


Tyr 


Ser 


Gin 


Glu 


Pro 


Gly 


Asn 


Pro 


Thr 


Lys Ser Ala Lys 


Ala 






20 










25 




30 


Met 


Gly 


Arg 


Asp 


Leu 
35 


Arg 


Val 


His 


Phe 


Lys 
40 


Asn Thr Arg Glu 


Thr 
45 


Ala 


Phe 


Ala 


Leu 


Arg 


Lys 


Leu 


Pro 


Leu 


Thr 


Lys Ala Lys Arg 


Tyr 










50 








55 


60 


Leu 


Glu 


Asp 


Val 


He 


Ala 


His 


Lys 


Gin 


Ala 


He Pro Phe Arg 


Arg 








- 65 










70 




75 


Tyr 


Cys 


Gly 


Gly 


Val 
80 


Gly 


Arg 


Thr 


Ala 


Gin 
85 


Ala Lys Ser Arg 


His 
90 


Ser 


Asn 


Gly 


Gin 


Gly 
95 


Arg 


Trp 


Pro 


Val 


Lys 
100 


Ser Ala Arg Phe 


He 
105 


Lea 


Asp 


Leu 


Leu 


Lys 
110 


Asn 


Ala 


Glu 


Ser 


Asn 
115 


Ala Asp Val Lys 


Gly 
120 


Leu 


Asp 


Val 


Asp 


Asn 


Leu 


Tyr 


Val 


Ser 


His 


He Gin Val Asn 


Gin 








125 








130 




135 


Ala 


Gin 


Lys 


Gin 


Arg 
140 


Arg 


Arg 


Thr 


Tyr 


Arg 
145 


Ala His Gly Arg 


He 
150 


Asn 


Pro 


Tyr 


Met 


Ser 


Ser 


Pro 


Cys 


His 


He 


Glu Leu He Leu 


Ser 










155 








160 




165 


Glu 


Lys 


Glu 


Glu 


Pro 


Val 


Lys 


Lys 


Glu 


Ala 


Asp Asn He Val 


Ala 










170 






175 




180 


Ala 


Arg 


Lys 


Gin 
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<210> 389 
<211> 152 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 257695 . 20 .orf2 :2000MAY01 
<400> 389 



Ala 


Pro 


Arg 


Gly Glu 


Gly 


Cys 


Leu Val 


His 


Ala 


Ser Glu Pro 


Cys 


1 






5 








10 






15 


Arg 


Pro 


Arg 


Ala Arg 


Cys 


Ser 


Leu Cys 


Arg 


Ser 


Ser Asp Ala 


Arg 








20 








25 






30 


Arg 


Gin 


Arg 


Gin Leu 


Trp 


Ala 


His Cys 


Lys 


Arg 


Gly Asn Gly 


Leu 








35 








40 






45 


He 


Lys 


Val 


Asn Gly 


Arg 


Pro 


Leu Glu 


Met 


He 


Glu Pro Arg 


Thr 








50 








55 






60 


Leu 


Gin 


Tyr 


Lys Leu 


Leu 


Glu 


Pro Val 


Leu 


Leu 


Leu Gly Lys 


Glu 








65 








70 




75 


Arg 


Phe 


Ala 


Gly Val 


Asp 


He 


Arg Val 


Arg 


Val 


Lys Gly Gly 


Gly 








80 








85 






90 


His 


Val 


Pro 


Gin He 


Tyr 


Gly 


Glu Ser 


Gin 


Glu 


Leu Gly Ala 


Trp 








95 








100 






105 


Arg 


Arg" 


Tip 


Leu rrp 


Glu 


Gly 


Gly Leu 


His 


Ser 


Ala Pro Val 


Pro 








110 








115 






120 


Phe 


Asn 


Cys 


Val Ser 


Phe 


Ser 


Gin Leu 


Ser 


Val 


Ser Pro Ser 


Pro 








125 








130 






135 


Lys 


Pro 


Trp 


Trp Pro 


He 


Thr 


Arg Asn 


Val 


Ser 


Glu His Gly 


Ser 



140 145 150 

Phe Pro 



<210> 390 
<211> 158 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 455771 . l.orf 3 : 2000MAY01 
<400> 390 



Ala 


Ser 


Cys Ser 


Arg 


Arg Arg Glu Ala 


Leu 


Gin Arg Thr Ser 


Val 


1 






5 




10 




15 


Asn 


Met 


Gly Lys 


Thr 


Arg Gly Met Gly 


Ala 


Gly Arg Lys Leu 


Lys 








20 




25 




30 


Thr 


His 


Arg Arg 


Asn 


Gin Arg Trp Ala 


Asp 


Lys Ala Tyr Lys 


Lys 








35 




40 




45 


Ser 


His 


Leu Gly 


Asn 


Glu Trp Lys Lys 


Pro 


Phe Ala Gly Ser 


Ser 








50 




55 




60 


His 


Ala 


Lys Gly 


lie 


Val Leu Glu Lys 


lie 


Gly He Glu Ala 


Lys 








65 


70 




75 


Gin 


Pro 


Asn Ser 


Ala 


He Arg Lys Cys 


Ala 


Arg Val Gin Leu 


Val 








80 




85 




90 


Lys 


Asn 


Gly Lys 


Lys 


He Ala Ala Phe 


Val 


Pro Asn Asp Gly 


Cys 








95 




100 




105 


Leu 


Asn 


Tyr He 


Glu 


Glu Asn Asp Glu 


Val 


Leu He Ala Gly 


Phe 








110 


115 


120 


Gly 


Arg 


Lys Gly 


His 


Ala Val Gly Asp 


He 


Pro Gly Val Arg 


Phe 








125 




130 




135 


Lys 


Val 


Val Lys 


Val 


Ser Gly Val Ser 


Leu 


Leu Ala Leu Phe 


Lys 








140 


145 




150 


Glu 


Lys 


Lys Glu 


Lys 


Pro Arg Ser 














155 











<210> 391 
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<2U> 94 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI: 274551 . l.orf 1:2000MAY01 
<400> 391 



Pro Phe 


Thr 


Val 


Thr 


Gin 


Leu Gin Pro 


Thr 


Thr 


Leu Gin Ser Phe 


1 






5 






10 




15 


Pro Cys 


Leu 


Ser 


Val 


Leu 


Gin Afg Leu 


Ser 


His 


Val Ser Gly Phe 








20 






25 




30 


Leu Arg 


Ser 


Ser 


Thr 


Leu 


He Gly Leu 


He 


Trp 


Cys Ser Ala Gin 








35 






40 




45 


Arg Ala 


Thr 


Pro 


Ser 


Leu 


Thr Tyr He 


Gly 


Ser 


Ser His Leu Asp 








50 




55 




60 


Ala Ser 


Thr 


Gin 


Arg 


Trp 


Ala Trp Pro 


Leu 


Ser 


Lys Ala Leu Ala 








65 






70 




75 


Ala Leu 


Gin 


Val 


Pro 


Pro 


Ala Arg Pro 


Ser 


Trp 


Leu Arg Ala Val 








80 






85 




90 


Phe Ser 


Leu 


Leu 















<210> 332 

<211> 83 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI:035973 . 1 .orf 3 :2000MAY01 
<400> 392 



Gly Cys 


Leu 


Ala Gly 


He 


Arg Lys Asp Asn 


Lys Met Lys Gly Thr 


1 




5 




10 




15 


Ser Pro 


Phe 


Gly Lys 


cys 


Arg Asp Met He 


His 


Lys Leu Cys Cys 






20 




25 




30 


Leu Cys 


Gly 


Ser Lys 


Ala 


Tyr His Leu Gin 


Lys 


Ser Thr Cys Gly 






35 




40 




45 


Lys Cys 


Gly 


Ser Pro 


Ala 


Lys Arg Lys Arg 


Lys 


Cys Asn Trp Thr 






50 




55 




60 


Ala Thr 


Ala 


Lys Arg 


Lys 


Tyr His Gly Asp 


Trp 


Leu Asn Glu Ala 






65 




70 




75 


Pro Lys 


His 


Cys He 


Leu 


Gin He 










80 











<210> 393 
<211> 174 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 978427 . 5. orf 2 :2000FEB18 
<220> 

<221> unsure 
<222> 151 

<223> unknown or other 
<400> 393 

Trp Trp Val Cys Asp Gly Cys Leu Cys Phe Arg Thr Thr Pro Ala 
15 10 15 

Val Leu Phe Trp Gin Trp He Asn Gin Ser Phe Asn Ala Val Val 
20 25 30 

Asn Tyr Thr Asn Arg Ser Gly Asp Ala Pro Leu Thr Val Asn Glu 
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35 40 45 ' 



Leu 


Gly 


Thr Ala 


Tyr 


Val 


Ser 


Ala 


Thr 


Thr 


Gly 


Ala Val Ala 


Thr 








50 










55 






60 


Ala 


Leu 


Gly Leu 


Asn 


Ala 


Leu 


Thr 


Lys 


His 


Val 


Ser Pro Leu 


He 








65 








70 






75 


Gly 


Arg 


Phe Val 


Pro 


Phe 


Ala 


Ala 


Val 


Ala 


Ala 


Ala Asn Cys 


He 








80 










85 




90 


Asn 


He 


Pro Leu 


Met 


Arg 


Gin 


Arg 


Glu 


Leu 


Lys 


Val Gly He 


Pro 








95 










100 




105 


Val 


Thr 


Asp Glu 


Asn 


Gly 


Asn 


Arg 


Leu 


Gly 


Glu 


Ser Ala Asn 


Ala 








110 










115 






120 


Ala 


Lys 


Gin Ala 


He 


Thr 


Gin 


Val 


Val 


Val 


Ser 


Arg He Leu 


Met 








125 










130 






135 


Ala 


Ala 


Pro Gly 


Met 


Gly 


He 


Pro 


Pro 


Phe 


He 


Met Asn Thr 


Leu 








140 








145 






150 


Xaa 


Lys 


Lys Ala 


Phe 


Leu 


Lys 


Arg 


Phe 


Pro 


Met 


Asp Glu Cys 


Thr 








155 










160 






165 


His 


Ser 


Ser Trp 


Val 


Ser 


Trp 


lie 


Leu 











170 

<210> 394 
<211> 183 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 247781. 2 .orf 3 :2000FEB18 
<400> 394 



Gin 


Gly 


Pro 


Arg 


Val 


Leu 


Leu 


Ala 


Met 


Pro 


Tyr 


Leu Pro Asn 


Ser 


1 








5 










10 






15 


Ala 


Gly 


Tyr 


His 


His 
20 


Leu 


Cys 


Gly 


His 


Arg 
25 


Pro 


Gly Arg Leu 


Arg 
30 


Asp 


Ser 


Glu 


Glu 


Leu 
35 


Val 


Ala 


Ser 


Ala 


Val 
40 


Gin 


Pro Arg Leu 


Gly 
45 


Arg 


Pro 


Arg 


His 


Pro 
50 


Arg 


Ala 


Pro 


Gly 


Leu 
55 


Arg 


Tyr His He 


Gin 
60 


His 


Leu 


Arg 


Pro 


Asp 
65 


Ser 


Gin 


Leu 


Pro 


Ala 
70 


Gly 


Pro Gly Pro 


Asp 
75 


Pro 


His 


Ala 


Gly 


Thr 
80 


Ser 


Leu 


His 


Arg 


Gly 
85 


Trp 


Pro Pro Ala 


Val 
90 


His 


Ala 


Gly 


Ser 


Ala 
95 


Thr 


Ser 


His 


Pro 


Val 
100 


Pro 


Gly Gly His 


Ala 
105 


Gly 


Pro 


Leu 


Pro 


Gly 
110 


Asp 


Arg 


Pro 


Gin 


Leu 
115 


His 


Glu Gly Tyr 


Ser 
120 


Ser 


Cys 


Glu 


His 


Leu 
125 


Leu 


Cys 


Gly 


Leu 


Arg 

130 


Glu 


His Glu Ala 


Gly 
135 


Leu 


Gly 


Gly 


His 


Val 
140 


Gin 


Val 


Arg 


Asp 


Pro 
145 


Glu 


Pro Val Pro 


Pro 
150 


He 


Pro 


His 


Pro 


Pro 
155 


His 


Leu 


Ser 


His 


Trp 
160 


Arg 


Leu Met He 


Gin 
165 


Pro 


Gin 


Asp 


Pro 


Tyr 
170 


Ser 


Leu 


Ala 


Thr 


Arg 
175 


Ser 


Gin Tyr Pro 


Asp 
180 



Pro Gly Ser 



<210> 395 
<211> 399 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> rais cofeature 

<223> Incyte ID No: LI : 034583 . 1 .orf 1 :2000FEB01 
<400> 395 
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Val 


Gin 


His Ala 


Thr 


Val 


He 


Pro 


Glu 


Thr 


Met 


Ala Gly Thr 


Gin 


1 








5 










10 






15 


Gin 


Leu 


Ala Asp 


Trp 


Arg 


Asn 


Thr 


His 


Ala 


His 


Gly Ser His 


Tyr 










20 










25 






30 


Asn 


Pro 


lie Met 


Gin 


Gin 


Pro 


Ala 


Leu 


Leu 


Thr 


Gly His Val 


Thr 










35 










40 






45 


Leu 


Pro 


Ala Ala 


Gin 


Pro 


Leu 


Asn 


Val 


Gly 


Val 


Ala His Val 


Met 










50 










55 






60 


Arg 


Gin 


Gin 


Pro 


Thr 


Ser 


Thr 


Thr 


Ser 


ser 


Arg 


Lys Ser Lys 


Gin 










65 










70 






75 


His 


Gin 


Ser 


Ser 


Val 


Arg 


Asn 


Val 


Ser 


Thr 


cys 


Glu Val Ser 


Ser 










80 










85 






90 


Ser 


Gin 


Ala 


lie 


Ser 


Ser 


Pro 


Gin 


Arg 


Ser 


Lys 


Arg Val Lys 


Glu 










95 










100 






105 


Asn 


Thr 


Pro 


Pro 


Arg 


Cys 


Ala 


Met 


Val 


His 


Ser 


Ser Pro Ala 


Cys 










110 










115 






120 


Ser 


Thr 


Ser 


Val 


Thr 


Cys 


Gly 


Trp 


Gly 


Asp 


Val 


Ala Ser Ser 


Thr 










125 










130 






135 


Thr 


Arg 


Glu Arg 


Gin 


Arg 


Gin 


Thr 


He 


Val 


He 


Pro Asp Thr 


Pro 










140 










145 






150 


Ser 


Pro 


Thr 


Val 


Ser 


Val 


He 


Thr 


He 


Ser 


Ser 


Asp Thr Asp 


Glu 










155 










160 






165 


Glu 


Glu 


Glu 


Gin 


Lys 


His 


Ala 


Pro 


Thr 


Ser 


Thr 


Val Ser Lys 


Gin 










170 










175 






180 


Arg 


Lys 


Asn 


Vai 


lie 


ser 


Cys 


Val 


Thr 


Val 


His 


Asp Ser Pro 


Tyr 










185 










190 






195 


Ser 


Asp 


Ser 


Ser 


Ser 


Asn 


Thr 


Ser 


Pro 


Tyr 


Ser 


Val Gin Gin 


Arg 










200 










205 






210 


Ala 


Gly 


His 


Asn 


Asn 


Ala 


Asn 


Ala 


Phe 


Asp 


Thr 


Lys Gly Ser 


Leu 










215 










220 






225 


Glu 


Asn 


His 


Cys 


Thr 


Gly 


Asn 


Pro 


Arg 


Thr 


He 


He Val Pro 


Pro 










230 










235 






240 


Leu 


Lys 


Thr 


Gin 


Ala 


Ser 


Glu 


Val 


Leu 


Val 


Glu 


Cys Asp Ser 


Leu 










245 










250 






255 


Val 


Pro 


Val 


Asn 


Thr 


Ser 


His 


His 


Ser 


Ser 


Ser 


Tyr Lys Ser 


Lys 










260 










265 




270 


Ser 


Ser 


Ser 


Asn 


Val 


Thr 


Ser 


Thr 


Ser 


Gly 


His 


Ser Ser Gly 


Ser 










275 










280 




285 


Ser 


Ser 


Gly Ala 


He 


Thr 


Tyr 


Arg 


Gin 


Gin 


Arg 


Pro Gly Pro 


His 










290 










295 






300 


Phe 


Gin 


Gin 


Gin 


Gin 


Pro 


Leu 


Asn 


Leu 


Ser 


Gin 


Ala Gin Gin 


His 










305 










310 






315 


lie 


Thr 


Thr Asp 


Arg 


Thr 


Gly 


Ser 


His 


Arg 


Arg 


Gin Gin Ala 


Tyr 










320 










325 






330 


lie 


Thr 


Pro 


Thr 


Met 


Ala 


Gin 


Ala 


Pro 


Tyr 


Ser 


Phe Pro His 


Asn 










335 










340 






345 


Ser 


Pro 


Ser 


His 


Gly 


Thr 


Val 


His 


Pro 


His 


Leu 


Ala Ala Ala 


Ala 










350 










355 






360 


Ala 


Ala 


Ala 


His 


Leu 


Pro 


Thr 


Gin 


Pro 


His 


Leu 


Tyr Thr Tyr 


Thr 










365 










370 






375 


Ala 


Pro 


Ala 


Ala 


Leu 


Gly 


Ser 


Thr 


Gly 


Thr 


Val 


Ala His Leu 


Val 










380 










385 






390 


Ala 


Ser 


Gin Gly 


Ser 


Ala 


Arg 


His 


Thr 











395 

<210> 396 
<211> 301 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 333307 . 2 .orf 1 :2000FEB01 
<220> 

<221> unsure 
<222> 286 



212/228 



WO 01/62927 
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<223> unknown or other 
<400> 396 



Thr 


Gin 


Phe 


Ser Asp 


Asn 


Asn 


Asn 


Glu 


Lys 


Leu 


Ser Pro Lys 


Pro 


1 








5 










10 




15 


Gly 


Thr 


Gly 


Glu 


Pro 


Val 


Leu 


Ser 


Leu 


His 


Tyr 


Ser Thr Glu 


Gly 








20 










25 






30 


Thr 


Thr 


Thr 


Ser 


Thr 


He 


kys 


Leu 


Asn 


Phe 


Thr 


Asp Glu Trp 


Ser 










35 










40 






45 


Ser 


He 


Ala 


Ser 


Ser 


Ser 


Arg 


Gly 


He 


Gly 


Ser 


His Cys Lys 


Ser 










50 










55 






60 


Glu 


Gly 


Gin 


Glu 


Glu 


Ser 


Phe 


Val 


Pro 


Gin 


Ser 


Ser Val Gin 


Pro 








65 










70 






75 


Pro 


Glu 


Gly 


Asp 


Ser 


Glu 


Thr 


Lys 


Ala 


Pro 


Glu 


Glu Ser Ser 


Glu 










80 










85 






90 


Asp 


Val 


Thr 


Lys 


Tyr 


Gin 


Glu 


Gly 


Val 


Ser 


Ala 


Glu Asn Pro 


Val 






95 








100 






105 


Glu 


Asn 


His 


He 


Asn 


He 


Thr 


Gin 


Ser 


Asp 


Lys 


Phe Thr Ala 


Lys 










110 










115 






120 


Pro 


Leu 


Asp 


Ser 


Asn 


Ser 


Gly 


Glu 


Arg 


Asn 


Asp 


Leu Asn Leu 


Asp 








125 








130 






135 


Arg 


Ser 


Cys 


Gly Val 


Pro 


Glu 


Glu 


Ser 


Ala 


Ser 


Ser Glu Lys 


Ala 










140 










145 






150 


Lys 


Glu 


Pro 


Glu 


Thr 


Ser 


Asp 


Gin 


Thr 


Ser 


Thr 


Glu Ser Ala 


Thr 










155 








160 






165 


Asn 


Glu 


Asn 


Asn 


Thr 


Asn 


Pro 


Glu 


Pro 


Gin 


Phe 


Gin Thr Glu 


Ala 










170 










175 






180 


Thr 


Gly 


Pro 


Ser 


Ala 


His 


Glu 


Glu 


Thr 


Ser 


Thr 


Arg Asp Ser 


Ala 








185 










190 




195 


Leu 


Gin 


Asp 


Thr Asp 


Asp 


Ser 


Asp 


Asp 


Asp 


Pro 


Val Leu He 


Pro 










200 










205 






210 


Gly 


Ala 


Arg 


Tyr 


Arg 


Ala 


Gly 


Pro 


Gly 


Asp 


Arg 


Phe Asn He 


Arg 










215 










220 






225 


Gly 


Thr 


Thr 


He Gly 


Asp 


Arg 


He 


Met 


Arg 


Arg 


Ser Ala Val 


Ala 










230 










235 






240 


Arg 


He 


Gin 


Glu 


Phe 


Phe 


Arg 


Arg 


Arg 


Lys 


Glu 


Arg Lys Glu 


Met 










245 










250 






255 


Glu 


Glu 


Leu 


Asp Thr 


Leu 


Asn 


He 


Arg 


Arg 


Pro 


Leu Val Lys 


Met 










260 










265 






270 


Val 


Tyr 


Lys 


Gly His 


Arg 


Asn 


Ser 


Arg 


Thr 


Met 


He Lys Glu 


Ala 










275 










280 






285 


Xaa 


Phe 


Trp 


Gly Ala 


Asn 


Phe 


Val 


Met 


Ser 


Gly 


Ser Asp Cys 


Gly 



290 295 300 

His 



<210> 397 
<211> 105 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 814710 . 2 .orf 2 :2000FEB01 
<400> 397 



Glu Gly Pro 


Glu 


Gly 


Lys 


Trp Lys 


Ser 


Pro 


Lys 


Phe Lys Met 


Pro 


1 




5 








10 






15 


Glu Met His 


Phe 


Lys 


Thr 


Pro Lys 


He 


Ser 


Met 


Pro Asp He 


Asp 






20 








25 




30 


Leu Asn Leu 


Thr 


Gly 


Pro 


Lys He 


Lys 


Gly 


Asp 


Val Asp Val 


Thr 






35 








40 






45 


Gly Pro Lys 


Val 


Glu 


Gly 


Asp Leu 


Lys 


Gly 


Pro 


Glu Val Asp 


Leu 






50 








55 






60 


Lys Gly Pro 


Lys 


Val 


Asp 


He Asp 


Val 


Pro 


Asp 


Val Asn Val 


Gin 






65 








70 






75 


Gly Pro Asp 


Trp 


His 


Leu 


Lys Met 


Pro 


Lys 


Met 


Lys Met Pro 


Lys 
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80 




85 






on 


Phe Ser Met Pro Gly 


Phe Lys Gly Glu Gly Pro Gly Ser Arg 


Ser 








95 




100 






105 


<210> 398 
















<211> 153 
















<212> PRT 
















<213> Homo sapiens 












<220> 


















<221> misc_feature 












<223> Incyte ID 


No: 


LG: 414732 . 1 .orf 1 :2000MAY19 




<400> 398 
















Trp Tyr 


Arg 


Arg 


Leu 


Leu Arg Glu Ser 


Gly 


Ser 


Thr Met Asp 


lie 


1 






5 




10 






15 


Pro Val 


Pro 


Ser 


Ser 


Phe Asn Asp Val 


Gly 


Gin 


Asp Trp Arg 


Leu 








20 




25 






30 


Arg His 


Phe 


Val 


Asp 


Gin Met Trp Tyr 


GlU 


Arg 


Glu Val Thr 


Phe 






35 




40 






45 


Leu Glu 


Gin 


Trp 


Thr 


Gin Asp Leu His 


Thr 


Arg 


Val Val Leu 


Arg 








50 




bb 






. 60 


lie Val 


Ser 


Ala 


His 


Ser Tyr Ala lie 


Val 


Trp 


Val Asn Gly 


Val 








6b 


70 




75 


Asp Ala 


Leu 


Glu 


His 


Glu Gly Ser Thr 


Ser 


Pro 


Leu Thr Pro 


Thr 






80 


85 






90 


Ser Val 


Ala 


Cys 


Ser 


Arg Trp Gly Pro 


Cys 


Pro 


Pro Ala Ser 


Ala 






95 




100 






105 


Ser Leu 


Ser 


Pro 


Ser 


Ala Thr Cys Ser 


Ser 


Pro 


Pro Pro Cys 


His 








110 




115 






120 


Gin Gly 


Ala 


Ser 


Ser 


Thr Trp Pro Thr 


Pro 


Pro 


Arg Gly Tyr 


His 






125 




130 






135 


Pro Ala 


Ser 


Thr 


Ala 


Asp Thr His Leu 


Pro 


Val 


Pro Pro Arg 


Gxy 








140 


145 




150 


Ala Leu 


His 
















<210> 399 
















<211> 161 
















<212> PRT 
















<213> Homo sapiens 












<220> 


















<221> misc_feature 












<223> Incyte ID 


NO: 


LG:413910.6.orfl:2000MAY19 




<400> 399 
















Ser Met 


Leu 


Ala 


Ser 


Gin- Gly Val Leu 


Leu 


His 


Pro Tyr Gly 


Val 


1 






5 




10 






15 


Pro Met 


lie 


Val 


Pro 


Ala Ala Pro Tyr 


Leu 


Pro 


Gly Leu lie 


Gin 








20 




25 






30 


Gly Asn 


Gin 


Glu 


Ala 


Ala Ala Ala Pro 


Asp 


Thr 


Met Ala Gin 


Pro 








35 




40 






45 


Tyr Ala 


Ser 


Ala 


Gin 


Phe Ala Pro Pro 


Gin 


Asn 


Gly lie Pro 


Ala 






50 




55 






60 


Glu Tyr 


Thr 


Ala 


Pro 


His Pro His Pro 


Ala 


Pro 


Glu Tyr Thr 


Gly 








65 




70 






75 


Gin Thr 


Thr 


Val 


Pro 


Glu His Thr Leu 


Asn 


Leu 


Tyr Pro Pro 


Ala 








80 




85 






90 


Gin Thr 


His 


Ser 


Glu 


Gin Ser Pro Ala 


Val 


Phe 


Leu Phe Val 


He 








95 




100 






105 


Thr Arg 


Ala 


Val 


Ala 


Leu Phe Thr Ser 


lie 


Leu 


Arg Pro Ser 


Thr 








110 




115 




120 


Thr Val 


Pro 


Cys 


Asn 


Phe Ser Leu Ala 


Leu 


Ser 


Ala Ser Ala 


Leu 








125 




130 






135 


Phe Ser 


Lys 


Val 


Thr 


Lys Pro Asn Pro 


Phe 


GlU 


Pro Arg Ser 


Leu 



214/228 
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140 145 150 

Lys He He Ser Thr Ser Lys He Leu Pro Asn 
155 160 

<210> 400 
<211> 153 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: ' LI:414732 .2 .orf 1:2000MAY01 
<400> 400 



Trp 


Tyr 


Arg 


Arg 


Leu 


Leu Arg Glu Ser 


Gly 


Ser 


Thr Met Asp 


He 


1 








5 




10 






15 


Pro 


Val 


Pro 


Ser 


Ser 


Phe Asn Asp Val 


Gly 


Gin 


Asp Trp Arg 


Leu 










20 




25 






30 


Arg 


His 


Phe 


Val 


Asp 


Gin Met Trp Tyr 


GlU 


Arg 


Glu Val Thr 


Phe 










35 




40 






45 


Leu 


Glu 


Gin 


Trp 


Thr 


Gin Asp Leu His 


Thr 


Arg 


Val Val Leu 


Arg 










50 




55 






60 


He 


Val 


Ser 


Ala 


His 


Ser Tyr Ala He 


Val 


Trp 


Val Asn Gly 


Val 










65 




70 






75 


Asp 


Ala 


Leu 


Glu 


His 


Glu Gly Ser Thr 


Ser 


Pro 


Leu Thr Pro 


Thr 










80 


85 






90 


Ser 


Val 


Ala 


Cys 


Ser 


Arg Trp Gly Pro 


Cys 


Pro 


Pro Ala Ser 


Ala 










95 




100 






105 


Ser 


Leu 


Ser 


Pro 


Ser 


Ala Thr Cys Ser 


Ser 


Pro 


Pro Pro Cys 


His 










110 


115 






120 


Gin 


Gly 


Ala 


Ser 


Ser 


Thr Trp Pro Thr 


Pro 


Pro 


Arg Gly Tyr 


His 










125 




130 






135 


Pro 


Ala 


Ser 


Thr 


Ala 


Asp Thr His Leu 


Pro 


Val 


Pro Pro Arg 


Gly 










140 




145 






150 


Ala 


Leu 


His 

















<210> 401 
<211> 135 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 900264 .2 .orf 3 :2000MAY01 
<400> 401 



Arg 


Gin 


He 


Val 


Ser 


Val 


Leu Ser Cys Arg 


Phe Asp Ser Asn 


Gin 


1 








5 




10 




15 


Leu 


Pro 


Ala 


Asn 


Ala 


Pro 


He Glu Asp Arg 


Arg Ser Ala Ala 


Thr 










20 




25 




30 


Cys 


Leu 


Gin 


Thr 


Arg 


Gly 


Leu Leu Leu Gly 


Val Phe Asp Gly 


His 










35 




40 




45 


Ala 


Gly 


Cys 


Ala 


Cys 


Ser 


Gin Ala Val Ser 


Glu Arg Leu Phe 


Tyr 










50 




55 




60 


Tyr 


He 


Ala 


Val 


Ser 


Leu 


Leu Pro His Glu 


Thr Leu Leu Glu 


He 








65 




70 




75 


Glu 


Asn 


Ala 


Val 


Glu 


Ser 


Gly Arg Ala Leu 


Leu Pro He Leu 


Gin 










80 




85 




90 


Trp 


His 


Lys 


His 


Pro 


Asn 


Asp Tyr Phe Ser 


Lys Glu Ala Ser 


Lys 










95 




100 




105 


Leu 


Tyr 


Phe 


Asn 


Ser 


Leu 


Arg Thr Tyr Trp 


Gin Gly Ala Tyr 


Arg 










110 




115 




120 


Pro 


Gin 


His 


Trp 


Val 


Ser 


Arg Leu He Leu 


Met Leu Arg Arg 


Leu 










125 




130 




135 
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<210> 402 
<211> 129 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 335593 . 1 .orf 3 :2000MAY01 
<220> 

<221> unsure 

<222> 28, 39-40 

<223> unknown or other 

<400> 402 



Arg 


Gly 


Ala 


Gly Thr 


Ala Ala Leu Pro 


Ser 


Arg 


Leu Leu Pro Ser 


1 






5 




10 




15 


Ser 


Ala 


Ala 


Arg Ser 


Ser Leu Gly He 


His 


Leu 


Leu Xaa Leu Leu 








20 




25 




30 


Leu 


Leu 


He 


His Ser 


Phe Pro His Xaa 


Xaa 


Leu 


Leu Leu Gly Phe 








35 




40 




45 


Ser 


Pro 


Arg 


Pro Ala 


Ser Pro Arg Ala 


Leu 


Pro 


Leu Pro Leu Pro 








50 




55 




60 


Val 


Leu 


Pro 


Gly Pro 


Leu Leu Pro Leu 


He 


His 


Ser Pro Leu Ser 








65 




70 




75 


Leu 


Leu 


His 


Ser Leu 


Pro Leu Ser Pro 


Phe 


Phe 


Phe Phe Phe His 








80 




85 




90 


Pro 


Pro 


Ser 


Leu Thr 


Pro Pro Pro Phe 


Pro 


Cys 


Leu Leu Ser Asp 








95 




100 


105 


Thr 


Ala 


Leu 


Gin Leu 


Leu Leu Ser Pro 


Ala 


Pro 


Ser Pro Val Arg 








110 




115 




120 


Thr 


Asn 


Gin 


Gin His 


Cys Phe Phe Ser 














125 









<210> 403 
<211> 299 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI: 1189543 . 1 .orf 1 : 2000MAY01 
<400> 403 



GlU 


Phe 


Arg 


Gin 


Asn 


Lys 


Arg 


Glu 


Asn 


Leu 


Leu 


Pro Val Ala Ala 


1 








5 










10 




15 


Ala 


Gly 


Thr 


Ala 


Asn 


Met 


Met 


Ala 


Ala 


Ala 


Pro 


He Gin Gin Asn 










20 










25 




30 


Gly 


Thr 


His 


Thr 


Gly 


Val 


Pro 


He 


Asp 


Leu 


Asp 


Pro Pro Asp Ser 










35 










40 




45 


Arg 


Lys 


Arg 


Pro 


Leu 


Glu 


Ala 


Pro 


Pro 


Glu 


Ala 


Gly Ser Thr Lys 










50 










55 




60 


Arg 


Thr 


Asn 


Thr 


Gly 


Glu 


Asp 


Gly 


Gin 


Tyr 


Phe 


Leu Lys Val Leu 










65 










70 




75 


He 


Pro 


Ser 


Tyr 


Ala 


Ala 


Gly 


Ser 


He 


He 


Gly 


Lys Gly Gly Gin 










80 










85 




90 


Thr 


He 


Val 


Gin 


Leu 


Gin 


Lys 


Glu 


Thr 


Gly 


Ala 


Thr He Lys Leu 










95 










100 




105 


Ser 


Lys 


Leu 


Ser 


Lys 


Ser 


Lys 


Asp 


Phe 


Tyr 


Pro 


Gly Thr Thr Glu 










110 










115 




120 


Arg 


Val 


Cys 


Leu 


He 


Gin 


Gly 


Thr 


Val 


Glu 


Ala 


Leu Asn Ala Val 










125 








130 




135 


His 


Gly 


Phe 


He 


Ala 


Glu 


Lys 


He 


Arg 


Glu 


Met 


Pro Gin Asn Val 










140 








145 




150 


Ala 


Lys 


Thr 


Glu 


Pro 


Val 


Ser 


He 


Leu 


Gin 


Pro 


Gin Thr Thr Val 










155 










16.0 




165 


Asn 


Pro 


Asp 


Arg 


He 


Lys 


Gin 


Thr 


Leu 


Pro 


Ser 


Ser Pro Thr Thr 
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170 




175 






180 


Thr 


Lys 


Ser Ser 


Pro 


Ser Asp Pro Met 


Thr 


Thr 


Ser Arg Ala 


Asn 






185 


190 




195 


Gin 


Val 


Lys lie 


He 


Val Pro Asn Ser 


Thr 


Ala 


Gly Leu He 


He 






200 




205 






210 


Gly 


Lys 


Gly Gly 


Ala 


Thr Val Lys Ala 


Val 


Met 


Glu Gin Ser 


Gly 








215 




220 






225 


Ala 


Trp 


Val Gin 


Leu 


Ser Gin Lys Pro 


Asp 


Gly 


He Asn Leu 


Gin 








23 0 




235 




240 


Glu 


Arg 


Val Val 


Thr 


Val Ser Gly Glu 


Pro 


Glu 


Gin Asn Arg 


Lys 








245 




250 






255 


Ala 


Val 


Glu Leu 


He 


He Gin Lys He 


Gin 


Glu 


Asp Pro Gin 


Ser 








260 


265 






270 


Gly 


Ser 


Cys Leu 


Asn 


He Ser Tyr Ala 


Asn 


Val 


Thr Gly Pro 


Val 






275 




280 




285 


Gly 


Lys 


Phe Gin 


Ser 


Asn Arg He Ser 


Leu 


Cys 


Lys His Cys 










290 




295 









<210> 404 
<211> 142 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> raisc_feature 

<223> Incyte ID No: LG:455450. 1 .orf 1 :2000FEB18 
<400> 404 



Gly 


Gly 


Gly He 


Leu 


Arg 


Arg 


Ser Gly 


Ser 


Ser 


Ser Ser Ser 


Ser 


1 






5 








10 






15 


Ser 


Ser 


Ser Ser 


Glu 
20 


Asp 


Asp 


Gly Met 


Gly 
25 


Gly 


Arg Arg Lys 


Lys 
30 


Gly 


Leu 


Lys Glu 


Lys 
35 


He 


Lys 


Glu Lys 


Met 
40 


Pro 


Gly Gly His 


Arg 
45 


Glu 


Gly 


Gin Gly 


Gin 
50 


Ala 


Thr 


Ala Thr 


Gly 
55 


Ala 


Tyr Gly Gly 


Thr 
60 


Gly 


Tyr 


Val Ala 


Gly 
65 


Pro 


Thr 


Thr Gly 


Gly 
70 


Pro 


His Glu Lys 


Lys 
75 


Gly 


Val 


Val Glu 


Lys 
80 


He 


Lys 


Glu Lys 


He 
85 


Pro 


Gly Gly His 


Lys 
90 


Asp 


Tyr 


Asp Gin 


His 
95 


Gin 


His 


Thr Thr 


Ala 
100 


Ala 


Thr Gly Gly 


Gly 
105 


Gly 


Gly 


Tyr Gly 


Gly 
110 


Thr 


Thr 


Asp Thr 


Thr 
115 


Tyr 


Gly Thr Thr 


Thr 
120 


Thr 


Glu 


Gly Thr 


His 
125 


Glu 


Lys 


Lys Gly 


Phe 
130 


Met 


Asp Lys He 


Lys 
135 


Glu 


Lys 


Leu Pro 


Gly 


Gin. 


His 
















140 

















<210> 405 
<211> 168 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1040978 . 1 .orf 2 :2000FEB18 
<400> 405 

Ser Tyr Leu Arg Ser Arg Gly Gin Pro Pro Pro Arg Arg Ser His 

15 10 15 

Ala Leu Arg Ala Arg Arg Leu Ser Ser Val Ser Ala Ser Leu Pro 

20 25 30 
Leu Pro Ser Arg Leu Thr His Met Ala Ser He Ala Gly Ser Ser 

35 40 45 

Ala Leu Ser Phe Ala Arg Pro Val Lys Ala He Asn Thr Asn Ser 

50 ~ 55 60 
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Leu 


Ala 


Phe 


Ser 


Pro 


Ala 


Arg 


Lys Gly 


Asn 


Thr 


Phe Leu Arg Leu 










65 






70 




75 


Gin 


Pro 


Met 


Pro 


Met 


Arg 


Ser 


Val Ser 


Cys 


Ala 


Ala Lys Lys Asp 










80 








85 




90 


Thr 


Thr 


Asp 


Lys 


Val 


Cys 


Glu 


lie Val 


Lys 


Lys 


Gin Leu Ala Leu 










95 








100 




105 


Pro 


Asp 


His 


Thr 


Glu 


Val 


Cys 


Gly Glu 


Ser 


Lys 


Phe Ser Glu Leu 










110 








115 




120 


Gly 


Ala 


Asp 


Ser 


Leu 


Asp 


Thr 


Val Glu 


lie 


Val 


Met Ser Leu Glu 










125 








130 




135 


Glu 


His 


Phe 


Asp 


lie 


Ser 


Val 


Glu Glu 


Ser 


Ser 


Ala Gin Thr He 










140 








145 




150 


Ala 


Thr 


Val 


Glu 


Asp 


Ala 


Ala 


Asp Leu 


He 


Asp 


Lys Leu Val Ala 



155 160 165 

Gly Lys Ala 



<210> 406 
<211> 117 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:446649 . 1 .orf 2 :2000FEB18 
<400> 406 



Leu Lys 


Ala 


Arg Glu 


Gin Ala Gin Lys 


Arg 


Glu 


Ala He Gin Val 


1 




5 


10 




15 


Thr Ser 


Pro 


Val Cys 


Leu Arg Leu He 


Leu 


Arg 


Lys Ala Gly Glu 






20 




25 




30 


Glu Val 


Lys 


Arg Leu 


Lys Thr Gin Pro 


Thr 


Asp 


Glu Glu Met Leu 






35 




40 




45 


Phe He 


Tyr 


Ser His 


Phe Lys Gin Ala 


Thr 


Val 


Gly Asp Val Asn 






50 


55 




60 


Thr Asp 


Arg 


Pro Gly 


Leu Leu Asp Leu 


Lys 


Gly 


Lys Ala Lys Trp 






65 




70 




75 


Asp Ser 


Trp 


Asn Lys 


Leu Lys Gly Thr 


Ser 


Lys 


Glu Asn Ala Met 






80 




85 




90 


Lys Thr 


Tyr 


Val Glu 


Lys Val Glu Glu 


Leu 


Lys 


Lys Lys Tyr Gly 






95 




100 




105 


He Leu 


Thr 


Thr Arg 


Phe Gly Gly Gin 


Pro 


His 


Val 






110 


115 







<210> 407 
<211> 804 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 132147 . 3 .orf 3 :2000FEB18 
<400> 407 



Ala Ser 


Ser 


Ala 


Ala 


Gin 


Asp 


Asp Tyr 


Ala 


Arg 


His Val Arg Asp 


1 






5 








10 




15 


Pro Pro 


Ala 


Val 


Ala 


Val 


Ala 


Ala Pro 


Lys 


Ala 


Arg Ala Ser Ala 








20 








25 




30 


Pro Asp 


Ala 


Val 


Val 


Thr 


Pro 


Pro Arg 


Leu 


Ala 


Val Ala Pro Val 








35 








40 




45 


Pro Val 


Leu 


Pro 


Ala 


Ala 


Ser 


Ala Ala 


Gly 


Asp 


Met Ser Asn Pro 








50 








55 


60 


Gly Ser 


Arg 


Arg 


Asn 


Gly 


Pro 


Val Lys 


Leu 


His 


Leu Thr Val Leu 








65 






70 




75 


Cys Ala 


Lys 


Asn 


Leu 


Val 


Lys 


Lys Asp 


Phe 


Phe 


Arg Leu Pro Asp 








80 








85 




90 


Pro Phe 


Ala 


Lys 


Val 


Val 


Val 


Asp Gly 


Ser 


Gly 


Gin Cys His Ser 



218/228 



WO 01/62927 



PCT/US01/06059 



Thr 


Asp 


Thr 


Tyr 


Asp 


Leu 


Trp 


Asn 


His 


Gly 


Cys 


Val 


Thr 


Gly 


Tyr 


Asn 


Asp 


Thr 


Asp 


Arg 


He 


Phe 


Asp 


Asn 


Ser 


Gly 


Arg 


Trp 


Glu 


Arg 


Arg 


Pro 


Leu 


Thr 


Asn 


Gly 


GlU 


Arg 


Arg 


Thr 


His 


Leu 


Arg 


Thr 


Thr 


Gly 


Val 


Ser 


Asn 


He 


Asn 


lie 


Arg 


Asn 


Asn 


Arg 


Thr 


His 


Leu 


Val 


Gin 


Gin 


Val 


Val 


Pro 


Arg 


Arg 


Gin 


Glu 


He 


Glu 


Val 


Val 


Met 


Lys 


Lys 


Phe 


Arg 


Glu 


Trp 


Leu 


Gly 


Leu 


Phe 


Asn 


Pro 


Asp 


Phe 


Val 


Gly 


He 


Asp 


Gly 


Lys 


Ser 


He 


His 


Asn 


Ser 





95 




Val 


Lys 
110 


Asn 


Tyr 


He 
125 


Gly 


Lys 


Lys 
140 


He 


Arg 


Leu 
155 


Leu 


Gin 


Arg 
170 


Leu 


Val 


Arg 
185 


Gly 


Gly 


Thr 
200 


Gly 


Asp 


Leu 
215 


Pro 


He 


Gin 
230 


Tyr 


Pro 


Thr 
245 


Arg 


Ser 


Cys 
260 


Phe 


Ala 


Thr 

275 


Cys 


Val 


Arg 
290 


Ser 


His 


Thr 
305 


Pro 


Gin 


Gin 
320 


Gly 


Thr 


Trp 
335 


His 


Cys 


Glu 
350 


Glu 


Thr 


Ala 
365 


Thr 


Thr 


Gin 
380 


Phe 


Leu 


Asn 
395 


Arg 


Val 


Ser 
410 


Leu 


Tyr 


Lys 
425 


Arg 


Leu 


Ser 
440 


Gin 


Ser 


Arg 
455 


Glu 


Met 


Arg 
470 


Pro 


Gly 


Glu 
485 


Glu 


Tyr 


Leu 
500 


Leu 


Gin 


Tyr 
515 


Ser 


Ser 


Ala 
530 


Val 


Arg 


He 
545 


Met 


Gly 


Phe 
560 


Thr 


Thr 


Leu 
575 


Asp 


Leu 


Val 


Trp 




590 



Thr 


Leu 


Asp 


Lys 


Ser 


Asp 


His 


Lys 


Lys 


Ser 


Asn 


Ala 


Asp 


Leu 


Cys 


Gin 


He 


Val 


Gly 


Gin 


Val 


Asp 


Gly 


Trp 


Leu 


Asn 


His 


Pro 


Ala 


Ser 


Val 


Asp 


Glu 


Gly 


Gin 


Ser 


Gin 


Arg 


His 


Pro 


Asp 


Leu 


Gin 


Val 


Tyr 


Asp 


Pro 


Arg 


Leu 


Gly 


Pro 


Gly 


Arg 


Val 


Thr 


Asp 


Pro 


Gin 


Asn 


Gin 


Cys 


Pro 


Asp 


Asp 


Leu 


Val 


Gin 


Gin 


Pro 


GlU 


He 


Phe 


Lys 


Asp 


Leu 


Gly 


Leu 


Asp 


Ser 


His 


Glu 


Arg 


Asp 


Asp 


Asn 


Pro 


Glu 


Gly 


Met 


Ala 


Leu 


Pro 


Phe 


Asp 


Met 


Glu 


He 


Leu 


Glu 



100 






Pro 


Lys 


Trp 


115 






Ser 


Val 


Thr 


130 






Gin 


Gly 


Ala 


145 






He 


Asn 


Arg 


160 






Lys 


Leu 


Gly 


175 






Val 


Ser 


Leu 


190 






Val 


Asp 


Cys 


205 






Glu 


Glu 


Arg 


220 






He 


Thr 


Arg 


235 






Glu 


Tyr 


Ser 


250 






Asn 


Thr 


Pro 


265 






Ser 


Asp 


Pro 


280 






Arg 


Asn 


Tyr 


295 






Pro 


Glu 


Gly 


310 






Phe 


Leu 


His 


325 






Val 


Pro 


Arg 


340 






Leu 


Pro 


Pro 


355 






Tyr 


Phe 


Val 


370 






Arg 


Leu 


Ser 


385 






Leu 


Lys 


Asp 


400 






Asp 


Thr 


Glu 


415 






Gin 


Lys 


Leu 


430 






Gin 


Ala 


Gly 


445 




Glu 


Glu 


Ser 


460 






Trp 


Lys 


Arg 


475 






Tyr 


Gly 


Gly 


490 






Met 


Leu 


Asn 


505 






He 


Tyr 


Thr 


520 






His 


Leu 


Ser 


535 






Val 


Phe 


His 


550 






Tyr 


Lys 


Gin 


565 






Leu 


Val 


Asp 


580 






Asn 


Asp 


He 


595 











105 


Asn 


Gin 


His 
120 


He 


Ser 


Val 
135 


Gly 


Phe 


Leu 




150 


Leu 


Lys 


Asp 
165 


Pro 


Asn 


Asp 
180 


Gin 


Ser 


Arg 
195 


Ser 


Arg 


Leu 
210 


Arg 


Thr 


Ala 
225 


Thr 


Thr 


Gin 
240 


Ser 


Pro 


Gly 
255 


He 


Ser 


Gly 
270 


Arg 


Leu 


Ala 
285 


Met 


Ser 


Arg 
300 


Tyr 


Glu 


Gin 




315 


Thr 


Gin 


Thr 
330 


Asp 


Leu 


Ser 
345 


Gly 


Trp 


Glu 
360 


Asp 


His 


Asn 
375 


Ala 


Asn 


Leu 
390 


Gin 


Gin 


Gin 
405 


Cys 


Leu 


Thr 
420 


Lys 


He 


Leu 
435 


His 


Cys 


Arg 
450 


Tyr 


Arg 


Gin 
465 


Leu 


Met 


He 
480 


Val 


Ala 


Arg 
495 


Pro 


Tyr 


Tyr 
510 


Leu 


Gin 


He 
525 


Tyr 


Phe 


His 
540 


Gly 


His 


Tyr 
555 


Leu 


Leu 


Gly 
570 


Pro 


Asp 


Leu 
585 


Thr 


Gly 


Val 




600 



219/228 



WO 01/62927 



PCT/US01/06059 



Leu 


Asp 


His Thr 


Phe 


Cys 


Val 


Glu His 


Asn 


Ala 


Tyr Gly Glu 


He 






605 








610 






615 


He 


Gin 


His Glu 


Leu 


Lys 


Pro 


Asn Gly 


Lys 


Ser 


He Pro Val 


Asn 








620 








625 






630 


Glu 


Glu 


Asn Lys 


Lys 


Glu 


Tyr 


Val Arg 


Leu 


Tyr 


Val Asn Trp 


Arg 








635 










640 






645 


Phe 


Leu 


Arg Gly 


He 


Glu 


Ala 


Gin 


Phe 


Leu 


Ala 


Leu Gin Lys 


Gly 






650 










655 






660 


Phe 


Asn 


Glu Val 


He 


Pro 


Gin 


His 


Leu 


Leu 


Lys 


Thr Phe Asp 


Glu 








665 










670 




675 


Lys 


Glu 


Leu Glu 


Leu 


He 


He 


Cys Gly 


Leu 


Gly 


Lys He Asp 


Val 








680 










685 






690 


Asn 


Asp 


Trp Lys 


Val 
695 


Asn 


Thr 


Arg 


Leu 


Lys 
700 


His 


Cys Thr Pro 


Asp 
705 


Ser 


Asn 


He Val 


Lys 
710 


Trp 


Phe 


Trp 


Lys 


Ala 
715 


Val 


Glu Phe Phe 


Asp 
720 


Glu 


Glu 


Arg Arg 


Ala 
725 


Arg 


Leu 


Leu 


Gin 


Phe 
730 


Val 


Thr Gly Ser 


Ser 
735 


Arg 


Val 


Pro Leu 


Gin 
740 


Gly 


Phe 


Lys 


Ala 


Leu 
745 


Gin 


Gly Ala Ala 


Gly 
750 


Pro 


Arg 


Leu Phe 


Thr 
755 


He 


His 


Gin 


He 


Asp 
760 


Ala 


Cys Thr Asn 


Asn 
765 


Leu 


Pro 


Lys Ala 


His 
770 


Thr 


Cys 


Phe 


Asn 


Arg 
775 


He 


Asp He Pro 


Pro 
780 


Tyr 


Glu 


Ser Tyr 


Glu 
785 


Lys 


Leu 


Tyr 


Glu 


Lys 
790 


Leu 


Leu Thr Ala 


He 
795 


Glu 


Glu 


Thr Cys 


Gly 
800 


Phe 


Ala 


Val 


Glu 











<210> 408 
<211> 220 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI.-036034. 1 .orf 1:2000FEB01 
<400> 408 



Thr 


He 


His 


Leu 


Lys 


Thr 


Leu 


lie He 


Val 


Trp Lys Arg Tyr 


Ser 


1 








5 








10 




15 


Asp 


Phe 


Lys 


Lys 


Leu 


His 


Lys 


Glu Leu 


Trp 


Gin He His Lys 


Asn 










20 








25 




30 


Leu 


Phe 


Arg 


His 


Ser 


Glu 


Leu 


Phe Pro 


Pro 


Phe Ala Lys Gly 


He 










35 








40 




45 


Val 


Phe 


Gly 


Arg 


Phe 


Asp 


Glu 


Thr Val 


He 


Glu Glu Arg Arg 


Gin 










50 








55 




60 


Tyr 


Ala 


Glu 


Asp 


Leu 


Leu 


Gin 


Phe Ser 


Ala 


Asn He Pro Ala 


Leu 










65 








70 




75 


Tyr 


Asn 


Ser 


Lys 


Gin 


Leu 


Glu 


Asp Phe 


Phe 


Lys Gly Gly He 


He 










80 








85 




90 


Asn 


Asp 


Ser 


Ser 


Glu 


Leu 


He 


Gly Pro 


Ala 


Glu Ala His Ser 


Asp 










95 






100 




105 


Ser 


Leu 


He 


Asp 


Thr 


Phe 


Pro 


Glu Cys 


Ser 


Thr Glu Gly Phe 


Ser 










110 






115 


120 


Ser 


Asp 


Ser 


Asp 


Leu 


Val 


Ser 


Leu Thr 


Val 


Asp Val Asp Ser 


Leu 










125 








130 


135 


Ala 


Glu 


Leu 


Asp 


Asp 


Gly 


Met 


Ala Ser 


Asn 


Gin Asn Ser Pro 


He 










140 








145 




150 


Arg 


Thr 


Phe 


Gly 


Leu 


Asn 


Leu 


Ser Ser 


Asp 


Ser Ser Ala Leu 


Gly 










155 








160 




165 


Ala 


Val 


Ala 


Ser 


Asp 


Ser 


Glu 


Gin Ser 


Lys 


Thr Glu Glu Glu 


Arg 










170 








175 




180 


Glu 


Ser 


Arg 


Ser 


Leu 


Phe 


Pro 


Gly Ser 


Leu 


Lys Pro Lys Leu 


Gly 










185 








190 




195 


Lys 


Arg 


Asp 


Tyr 


Leu 


Glu 


Lys 


Ala Gly 


Glu 


Leu He Lys Leu 


Ala 










200 




205 




210 
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Leu Lys Lys Glu Glu Glu Asp Asp Tyr Glu 
215 220 

<210> 409 
<211> 168 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 162161 . 1 .orf 2 :2000MAY19 
<400> 409 



Gin 


Asp 


His 


Gin 


Phe 


Ala 


Pro 


Gin 


Ser 


Thr 


Met 


Ser 


Arg Ser Arg 


1 








5 










10 






15 


Gin 


Pro 


Pro 


Leu 


Val 
20 


Thr 


Gly 


He 


Ser 


Pro 
25 


Asn 


Glu 


Gly He Pro 
30 


Trp 


Thr 


Lys 


Val 


Thr 


He 


Arg 


Gly 


Glu 


Asn 


Leu 


Gly 


Thr Gly Pro 








35 










40 






45 


Thr 


Asp 


Leu 


He 


Gly 
50 


Leu 


Thr 


He 


Cys 


Gly 
55 


His 


Asn 


Cys Leu Leu 
60 


Thr 


Ala 


Glu 


Trp 


Met 
65 


Ser 


Ala 


Ser 


Lys 


He 
70 


Val 


Cys 


Arg Val Gly 
75 


Gin 


Ala 


Lys 


Asn 


Asp 
80 


Lys 


Gly 


Asp 


He 


He 
85 


Val 


Thr 


Thr Lys Ser 
90 


Gly 


Gly 


Arg 


Gly 


Thr 
95 


Ser 


Thr 


Val 


Ser 


Phe 
100 


Lys 


Leu 


Leu Lys Pro 
105 


Glu 


Lys 


He 


Gly 


He 
110 


Leu 


Asp 


Gin 


Set 


Ala 
115 


Val 


Trp 


Val Asp Glu 
120 


Met 


Asn 


Tyr 


Tyr 


Asp 
125 


Met 


Arg 


Thr 


Asp 


Arg 
130 


Asn 


Lys 


Gly He Pro 
135 


Pro 


Leu 


Ser 


Leu 


Arg 


Pro 


Ala 


Asn 


Pro 


Leu 


Gly 


Met 


Glu He Glu 










140 










145 




150 


Pro 


Ser 


Thr 


Phe 


Ser 
155 


Gin 


Lys 


Asp 


Leu 


Glu 
160 


Met 


Leu 


Phe His Gly 
165 



Met Ser Ala 



<210> 410 
<211> 108 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc„feature 

<223> Incyte ID No: LG:407214 . 10 .orf 2 :2000MAY19 
<400> 410 



Lys 


Ser 


Arg Met Asp 


Leu Asp Val Val 


Asn 


Met Phe 


Val He Ala 


1 




5 




10 




15 


Gly 


Gly 


Thr Leu Ala 


He Pro He Leu 


Ala 


Phe Val 


Ala Ser Phe 






20 




25 




30 


Leu 


Leu 


Trp Pro Ser 


Ala Leu He Arg 


He 


Tyr Tyr 


Trp Tyr Trp 






35 




40 




45 


Arg 


Arg 


Thr Leu Gly 


Met Gin Val Arg 


Tyr 


Val His 


His Glu Asp 






50 




55 




60 


Tyr 


Gin 


Phe cys Tyr 


Ser Phe Arg Gly 


Arg 


Pro Gly 


His Lys Pro 






65 




70 




75 


Ser 


He 


Leu Met Leu 


His Gly Phe ser 


Gly 


His Lys 


Asp Met Trp 






80 




85 




90 


Leu 


Ser 


Val Val Lys 


Val Pro Ser Lys 


Glu 


Pro Ala 


Leu Gly Leu 






95 




100 




105 


Arg 


Gly 


His 











<210> 411 
<211> 314 
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<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 204626 . 1 . orf 1 : 2000MAY19 
<400> 411 



Gly Val 


Ala 


Arg 


Lys 


Lys 


Lys He Val He 


Lys 


Glu 


Glu Pro 


Lys 


1 






5 




10 








15 


Arg Arg 


Lys 


Gly Lys 


Met 


Lys Asp Arg Leu 


Gin 


Glu 


Leu Lys 


Gin 








20 




25 








30 


Arg Thr 


Lys 


Glu 


He 


Glu 


Leu Ser Arg Asp 


Ser 


His 


Val Ser 


Thr 








35 




40 








45 


Thr Glu 


Thr 


Glu 


Glu 


Gin 


Gly Val Phe Leu 


Gin 


Gin 


Ala Val 


He 








50 




55 








60 


Tyr Glu 


Arg 


Glu 


Pro 


Val 


Ala Glu Arg His 


Leu 


His 


Glu He 


Gin 








65 




70 








75 


Lys Leu 


Gin 


Glu 


Ser 


He 


Asn Asn Leu Ala 


Asp 


Asn 


Val Gin Lys 








80 




85 








90 


Phe Gly 


Gin 


Gin 


Gin 


Lys 


Ser Leu Val Ala 


Ser 


Met 


Arg Arg 


Phe 








95 


100 








105 


Ser Leu 


Leu 


Lys Arg 


Glu 


Ser Thr He Thr 


Lys 


Glu 


He Lys 


He 








110 




115 






120 


Gin Ala 


Glu 


Tyr 


He 


Asn 


Arg Ser Leu Asn 


Asp 


Leu 


Val Lys 


Glu 








125 




130 








135 


Val Lys 


Lys 


Ser 


Glu 


Val 


Glu Asn Gly Pro 


Ser 


Ser 


Val Val 


Thr 








140 




145 








150 


Arg He 


Leu 


Lys 


Ser 


Gin 


His Ala Ala Met 


Phe 


Arg 


His Phe 


Gin 








155 




160 








165 


Gin He 


Met 


Phe 


He 


Tyr 


Asn Asp Thr He 


Ala 


Ala 


Lys Gin 


Glu 








170 




175 








180 


Lys Cys 


Lys 


Thr 


Phe 


He 


Leu Arg Gin Leu 


Glu 


Val 


Ala Gly Lys 








185 




190 








195 


Glu Met 


Ser 


Glu 


Glu 


Asp 


Val Asn Asp Met 


Leu 


His 


Gin Gly Lys 








200 




205 








210 


Trp Glu 


Val 


Phe 


Asn 


Glu 


Ser Leu Leu Thr 


Glu 


He 


Asn He 


Thr 








215 




220 








225 


Lys Ala 


Gin 


Leu 


Ser 


Glu 


He Glu Gin Arg 


His 


Lys 


Glu Leu 


Val 








230 




235 








240 


Asn Leu 


Glu 


Asn 


Gin 


He 


Lys Asp Leu Arg 


Asp 


Leu 


Phe He 


Gin 








245 




250 








255 


He Ser 


Leu 


Leu 


Val 


Glu 


Glu Gin Gly Glu 


Ser 


He 


Asn Asn 


He 








260 




265 








270 


Glu Met 


Thr 


Val 


Asn 


Ser 


Thr Lys Glu Tyr 


Val 


Asn 


Asn Thr 


Lys 








275 




280 








285 


Glu Lys 


Phe 


Gly Leu 


Ala 


Val Lys Tyr Lys 


Lys 


Arg 


Asn Pro 


Cys 








290 




295 








300 


Arg Val 


Leu 


Cys 


Cys 


Trp 


Cys Cys Pro Cys 


Cys 


Ser 


Ser Lys 










305 




310 











<210> 412 
<211> 143 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 007401 . 1 .orf 2 : 20OOMAY01 
<400> 412 

Met Ala Ser Glu Ser Asp Thr Glu Glu Phe Tyr Asp Ala Pro Glu 
15 10 15 

Asp Val His Leu Gly Gly Gly Tyr Pro Val Gly Ser Pro Gly Lys 

20 25 30 

Val Gly Leu Ser Thr Phe Lys Glu Thr Glu Asn Thr Ala Tyr Lys 

35 40 45 
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Val Gly 


Asn Glu 


Ser 


Pro 


Val 


Gin 


Glu 


Leu 


Lys 


Gin Asp Val 


Ser 






50 










55 






60 


Asn Lys 


He He 


Glu 


Ser 


He 


He 


Glu 


Glu 


Ser 


Gin Lys Val 


Leu 






65 










70 




75 


Gin Leu 


Glu Asp 


Asp 


Ser 


Leu 


Asp 


Ser 


Thr 


Gly 


Lys Glu Leu 


Ser 






80 










85 






90 


Asp Gin 


Ala Thr 


Ala 


Ser 


Pro 


He 


Val 


Ala 


Arg 


Thr Asp Leu 


Ser 






95 










100 


105 


Asn He 


Pro Gly 


Leu 


Leu 


Ala 


He 


Asp 


Gin 


Val 


Leu Pro Glu 


Glu 






110 










115 






120 


Ser Gin 


Lys Ala 


Glu 


Ser 


Gin 


Asn 


Thr 


Phe 


Glu 


Glu Thr Glu 


Leu 






125 










130 






135 


Glu Phe 


Lys Lys 


Met 


Leu 


Ser 


Phe 














140 



















<210> 413 
<211> 122 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI:476342 .l.orf 1:2000MAY01 
<400> 413 



Gly Arg 


Thr Arg 


Val 


Ser 


Gly 


Pro 


Val 


Arg 


Asn 


Asn Thr Asp 


Pro 


1 




5 










10 






15 


He His 


Ala Gly 


Arg 


Arg 


Trp 


Ser 


Ser 


Ser 


Ser 


Pro Gly Arg 


Pro 






20 










25 






30 


Cys Ala 


Arg Ser 


Ser 


Trp 


Arg 


Cys 


Ser 


Cys 


Sfer 


His Thr Thr 


Thr 






35 










40 






45 


Ala Gly 


Arg Arg 


Arg 


Arg 


Trp 


Trp 


Arg 


Arg 


Pro 


Gly Cys Ala 


Trp 






50 










55 






60 


Ala Arg 


Ala Ser 


Thr 


Thr 


Arg 


Ser 


Pro 


Ala 


Ser 


Pro Thr Ala 


Ser 






65 










70 






75 


Ala Ala 


Thr Ser 


Ala 


Ser 


Arg 


Arg 


Thr 


Ala 


Gly 


Gly Pro Pro 


Ala 






80 








85 




90 


Thr Ala 


Thr Ser 


Ala 


Thr 


Ala 


Gly 


Ala 


Arg 


Arg 


Arg Ala Lys 


Gin 






95 










100 






105 


Ser Ser 


Ser Asn 


Thr 


Leu 


Gly 


Leu 


Pro 


Glu 


Leu 


Asn ser Ser 


Ser 






110 








115 






120 



Thr Lys 



<210> 414 

<211> 86 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 1072759 .1 .or f2 :2000MAY01 
<400> 414 



Arg Pro 


Glu 


Glu 


Asp 


Met 


Arg Gin 


Leu 


Val 


His 


Gly Ser Gin 


Arg 


1 






5 








10 






15 


Gly Thr 


Leu 


Arg 


Lys 


Met 


Gly Leu 


Gin 


Pro 


Arg 


His Ser Ser 


Leu 








20 






25 






30 


Trp Cys 


Gin 


Phe 


Leu 


Val 


Gly Met 


Val 


Thr 


Thr 


Phe Trp Lys 


Gin 








35 






40 




45 


Gly Ala 


lie 


He 


Ala 
50 


Leu 


Val Ser 


Arg 


Arg 
55 


Trp 


Lys Val Thr 


Arg 
60 


Lys Gly 


Trp 


Gin 


Cys 
65 


Gin 


Val Arg 


Thr 


Thr 
70 


Leu 


Ala Cys Arg 


Leu 
75 


Leu Asp 


Cys 


lie 


Leu 
80 


Pro 


Pro Asn 


Ser 


Tyr 
85 


Asn 
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<210> 415 
<211> 213 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 998857 . 1 .orf 3 :2000FEB18 
<400> 415 



Arg 


Gin 


Trp 


Gin 


Leu 


Leu 


Cys 


Cys Pro 


Cys 


Cys 


Cys Gly Gly 


Cys 


1 








5 








10 






15 


Trp 


Ala 


Arg 


Val 


Glu 


Val 


Gin 


Gly Trp 


Gly 


His 


Gly Leu Ala 


Pro 










20 








25 






30 


Gly 


Leu 


Gly 


Trp 


Ser 


Cys 


Pro 


Ala Gly 


Pro 


Gly 


Gly His Arg 


Gly 








35 








40 






45 


Cys 


Gly 


Cys 


Pro 


Gly 


Val 


Gly 


Arg Thr 


Trp 


Lys 


Ala Ala Arg 


Ser 










50 








55 






60 


Arg 


Gly 


Ser 


Trp 


Val 


Ser 


Trp 


Val Ser 


Trp 


Gly 


Thr Trp Arg 


Ser 










65 








70 






75 


Trp 


Arg 


Ser 


Trp 


Gly 


Ser 


Trp 


Gly Ser 


Trp 


Asp 


Ser Arg Ala 


Pro 








80 








85 






90 


Pro 


Pro 


Glu 


Pro 


Ser 


Arg 


Thr 


Pro Arg 


Gly 


Thr 


Asn Arg Ser 


Arg 










95 








100 






105 


Thr 


Ser 


Pro 


pro 


Ala 


cys 


Arg 


Arg Ala 


Ser 


Arg 


Ser Gly Pro 


Arg 










110 








115 






120 


Ala 


Pro 


Thr 


Pro 


Pro 


Pro 


Arg 


Pro Gin 


Arg 


Arg 


Ser Ala Ala 


Ala 










125 








130 






135 


Cys 


Gly 


Trp 


Arg 


Pro 


Arg 


Ser 


Gly Arg 


Pro 


Gly 


Arg Gly Leu 


Ser 










140 








145 






150 


Trp 


Arg 


Pro 


Arg 


Arg 


Arg 


Pro 


Gly Ala 


Arg 


Gin 


Arg Ser Trp 


Ser 










155 








160 






165 


Ala 


Ser 


Phe 


Gly 


Arg 


Cys 


Arg 


Cys Arg 


Ala 


Arg 


Arg Pro Thr 


Ala 










170 








175 






180 


Cys 


Gly 


Ala 


Gly 


Ser 


Asp 


Gly 


Leu Ala 


Leu 


Ala 


Gly Thr Arg 


Ala 








185 








190 






195 


Ser 


Arg 


Arg 


Ala 


Arg 


Cys 


Arg 


Arg Ser 


Arg 


Ser 


Arg Ala Gly 


Cys 



200 205 210 

Gly Gly Ser 



<210> 416 
<211> 263 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:482261. l.orf 1 :2000FEB18 
<400> 416 



Leu 


Leu 


Asn 


Phe 


Val 


Ser 


Leu 


Ser Leu 


Phe 


Leu 


Phe Ser Phe 


Pro 


1 








5 








10 






15 


Arg 


Leu 


His 


Arg 


Glu 
20 


Gly 


Glu 


Ser Leu 


Arg 
25 


Val 


Val Tyr Gin 


Ala 
30 


Gly 


Ser 


Pro 


Thr 


Ser 


Leu 


Ala 


Pro Asn 


Thr 


Val 


Ser Ser Asn 


Pro 








35 








40 






45 


Gly 


Glu 


Val 


Thr 


Pro 
50 


Glu 


Arg 


Gly Arg 


Cys 
55 


Glu 


Glu Arg Ser 


Val 
60 


Gin 


Glu 


Leu 


Pro 


Arg 
65 


Thr 


Cys 


Gly Arg 


Pro 
70 


Cys 


Gly Lys Leu 


Val 
75 


His 


Ser 


Glu 


His 


Ser 


Arg 


Asp 


Thr Met 


Gly 


Gin 


Ser Lys Ser 


Lys 










80 






85 






90 


His 


Ser 


Ala 


Tyr 


Leu 


His 


Phe 


He Lys 


Leu 


Leu 


Leu Lys Arg 


Ala 










95 






100 






105 


Gly 


He 


Lys 


Ala 


Ser 
110 


Thr 


Glu 


Asn Leu 


He 
115 


Thr 


Leu Phe Pro 


Thr 
120 



224/228 
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Val 


Glu 


Gin Tyr 


cys 


Pro 


Trp 


Phe 


Pro 


Glu 


His 


Gly Thr Met Asp 








125 










130 




135 


Phe 


Lys 


Asp Trp 


Glu 


Gin 


Val 


Gly 


He 


Ala 


Leu 


Lys Gin Val Cys 








140 










145 




150 


Lys 


Glu 


Gly Lys 


Phe 


He 


Pro 


Leu 


Thr 


Ala 


Trp 


Ser Asn Trp Ala 








155 










160 




165 


He 


Val 


Lys Ala 


Ala 


Ser 


Glu 


Pro 


Phe 


Gin 


Ser 


Glu Asn Glu Ala 






170 










175 




180 


Tyr 


Pro 


Pro Ala 


Glu 


Arg 


He 


Ser 


Ala 


Glu 


Glu 


Gly Gly Asp Ala 






185 










190 




195 


Ala 


Glu 


Gly Gly 


Glu 


Asp 


Ser 


Glu 


Glu 


Asp 


Phe 


Glu Glu Asn Thr 






200 










205 




210 


Asp 


Lys 


Pro Gly 


Asp 


Glu 


Leu 


He 


Ser 


Phe 


Glu 


Glu His Val Gly 


215 










220 




225 


Pro 


Ser 


Ala Ala 


Pro 


Lys 


He 


Glu 


Lys 


Pro 


Tyr 


Met Pro Arg Cys 








230 






235 




240 


Leu 


Lys 


Gin Arg 


Arg 


Ala 


Leu 


Arg 


Ser 


Ser 


Arg 


Leu Leu He Gly 






245 










250 




255 


He 


He 


Arg Ser 


Gly 


Arg 


Leu 


Gin 











260 

<210> 417 
<211> 175 
<212> PRT 
<2l3> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No; LG:480328. l.orf 1:2000FEB18 
<400> 417 



Val 


Arg 


Cys Gly 


Gly 


Gin Ser Gly 


Ser Arg 


Cys 


Glu Arg Val Pro 


1 






5 




10 




15 


Gin 


Lys 


Lys Thr 


Trp 


Leu Pro Lys 


Cys Leu 


Ser 


Pro Ser Ala Ser 








20 




25 




30 


Leu 


Gly 


Leu Ala 


Leu 


Ala Val Ala 


Gly Gly 


Met 


Val Asn Ser Ala 






35 




40 




45 


Leu 


Cys 


Asn Val 


Asp 


Ala Gly His 


Arg Ala 


Ala 


He Phe Asp Gin 






50 


55 




60 


Phe 


Arg 


Gly Val 


Gin 


Asn He Val 


Val Gly 


Glu 


Gly Thr His Phe 




65 




70 




75 


Leu 


He 


Pro Cys 


Val 


Gin Lys Pro 


He He 


Phe 


Asp Cys Cys Ser 






80 




85 




90 


Gin 


Pro 


Arg Ser 


Ala 


Pro Val He 


Thr Gly 


Ser 


Lys Asp Leu Gin 






95 




100 




105 


Asn 


Val 


Asn He 


Thr 


Leu Cys He 


Leu Phe 


Arg 


Pro He Thr Ser 








110 


115 




120 


Gin 


Leu 


Pro Arg 


He 


Phe Thr Ser 


He Gly 


GlU 


Asp Tyr Asp Glu 








125 




130 




135 


Cys 


Val 


Leu Pro 


Phe 


He Thr Thr 


Glu He 


Leu 


Lys Ser Leu Val 






140 




145 




150 


Ala 


Arg 


Phe Asp 


Ala 


Gly Glu Leu 


lie Thr 


Gin 


Arg Glu Leu Val 








155 


160 




165 


Ser 


Ser 


Gin Val 


Ser 


Asn Asn Leu 


Met Glu 












170 




175 







<210> 418 
<211> 272 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:311197 . l.orf 2 :2000MAY19 
<400> 418 

Gin Val Pro Phe Pro Ser His He Pro He Tyr Val Cys Leu Cys 
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1 






5 




10 




15 


Val 


Gin 


Gin Thr 


Pro 


Ala 


Phe Ala Thr Met 


Leu 


Ser Ser Thr Asp 








20 




25 




30 


Phe 


Thr 


Phe Ala 


Ser 


Tip 


Glu Leu Val Val 


Arg 


Val Asp His Pro 








35 


40 




45 


Asn 


Glu 


Glu Gin 


Gin 


Lys 


Asp Val Thr Leu 


Arg 


Val Ser Gly Asp 








50 


55 




60 


Leu 


His 


Val Gly 


Gly 


Val 


Met Leu Lys Leu 


Val 


Glu Gin He Asn 






65 




70 




75 


He 


Ser 


Gin Asp 


Trp 


Ser 


Asp Phe Ala Leu 


Trp 


Trp Glu Gin Lys 








80 




85 




90 


His 


Cys 


Trp Leu 


Leu 


Lys 


Thr His Trp Thr 


Leu 


Asp Lys Tyr Gly 




95 




100 




105 


Val 


Gin 


Ala Asp 


Ala 


Lys 


Leu Leu Phe Thr 


Pro 


Gin His Lys Met 






110 


115 




120 


Leu 


Arg 


Leu Arg 


Leu 


Pro 


Asn Leu Lys Met 


Val 


Arg Leu Arg Val 




125 




130 




135 


Ser 


Phe 


Ser Ala 


Val 


Val 


Phe Lys Ala Val 


Ser 


Asp He Cys Lys 








140 




145 




150 


He 


Leu 


Asn He 


Arg 


Arg 


Ser Glu Glu Leu 


Ser 


Leu Leu Lys Pro 








155 


160 




165 


Ser 


Gly 


Asp Tyr 


Phe 


Lys 


Lys Lys Lys Lys 


Lys 


Asp Lys Asn Asn 








170 




175 




180 


Lys 


Glu 


Pro He 


He 


Glu 


Asp He Leu Asn 


Leu 


Glu Ser Ser Pro 






185 




190 




195 


Thr 


Ala 


Ser Gly 


Ser 


Ser 


Val Ser Pro Gly 


Leu 


Tyr Ser Lys Thr 






200 




205 




210 


Met 


Thr 


Pro He 


Tyr 


Asp 


Pro lie Asn Gly 


Thr 


Pro Ala Ser Ser 








215 


220 




225 


Thr 


Met 


Thr Trp 


Phe 


Ser 


Asp Ser Pro Leu 


Thr 


Glu Gin Asn Cys 






23 0 




235 




240 


Ser 


He 


Leu Ala 


Phe 


Ser 


Gin Pro Pro Gin 


Ser 


Pro Glu Ala Leu 








245 




250 




255 


Ala 


Asp 


Met Tyr 


Gin 


Pro 


Arg Ser Leu Val 


Asp 


Thr Ala Lys Leu 




260 




265 




270 


Asn 


Ala 















<210> 419 
<211> 167 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1054883 .l.orf 1 :2000MAY19 
<220> 

<221> unsure 
<222> 2 

<223> unknown or other 
<400> 419 



Arg 


Xaa 


Arg 


Gin 


Pro 


Leu 


Leu 


Gin 


Ser 


His 


Pro 


Glu Ala Asp Trp 


1 








5 










10 




15 


Ser 


Thr 


His 


Gly 


Arg 


Ser 


Met 


Arg 


Lys 


Leu 


He 


Val Arg Phe He 








20 










25 




30 


Phe 


Leu 


Lys 


Phe 


Trp 


Thr 


Tyr 


Thr 


Val 


Arg 


Ala 


Ser Thr Asn Leu 








35 








40 




45 


Thr 


Gin 


Asn 


Gly 


Asp 


Cys 


Ser 


Gin 


Cys 


He 


Tyr 


Gin Val Thr Glu 










50 








55 




60 


Val 


Gly 


Gin 


Gin 


He 


Lys 


Thr 


He 


Phe 


Leu 


Phe 


Tyr Ser Tyr Tyr 








65 








70 




75 


Glu 


cys 


Met 


Glu 


Thr 


Leu 


Lys 


Glu 


Thr 


Cys 


Leu 


Tyr Asn Ala Thr 








80 








85 




90 


Gin 


Tyr 


Lys 


Val 


Cys 
95 


Ser 


Pro 


Arg 


Asn 


Asp 
100 


Arg 


Pro Asp Ala Cys 
105 
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Tyr Asn Pro Ser Glu Pro Ala Ala Thr Thr Val Phe Glu He Arg 

110 115 120 

Thr Gly Leu Leu Leu Gly Asp Thr Ser Lys He He Thr Arg Thr 

125 130 135 

Glu Glu Lys Glu He Pro Lys Gin He Thr Leu Arg Phe Asp Ala 

140 145 150 

Cys Ala Ala He Asn Ser Lys Lys Leu Glu He Gly Cys Gly Ser 

155 160 165 

Leu Asn 



<210> 420 

<211> 59 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: LG:399395 . 1 .orf 2 : 2000MAY19 
<400> 420 

Ser Gin His Phe Gly Arg Pro Arg Gin Glu Asp His Leu Ser Pro 
15 10 15 

Gly Val Gin Asp Gin Pro Gly Gin His Ser Glu Thr Leu Thr Gin 

20 25 . 30 

Lys He Lys Arg Lys Asp Lys Asn Thr Arg Met Ala Lys Gin Thr 

35 40 45 

Ser Val His Gin Pro Gly Gly He Leu Tyr Ser Leu Leu Lys 

50 55 



<210> 421 
<211> 216 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: LG:380497 .2 .orf 1:2000MAY19 
<400> 421 



Ser 


Pro 


Arg 


Pro 


Leu 


Gin 


Ser 


Ala Gly 


Glu 


Gly 


Val Thr His Val 


1 






5 






10 




15 


Leu 


He 


Leu 


Leu 


Glu 


Ser 


Pro 


Ala Arg 


Pro 


Val 


Ala Ala Val Thr 










20 






25 




30 


Gin 


Val 


Gin 


Arg 


Arg 


Arg 


Tyr 


His Arg 


Leu 


Ser 


Asp Met Ser Met 










35 








40 




45 


Leu 


Ala 


Glu 


Arg 


Arg 


Arg 


Lys 


Gin Lys 


Trp 


Ala 


Val Asp Pro Gin 










50 








55 




60 


Asn 


Thr 


Ala 


Trp 


Ser 


Asn 


Asp 


Asp Ser 


Lys 


Phe 


Gly Gin Arg Met 










65 








70 




75 


Leu 


Glu 


Lys 


Met 


Gly 


Trp 


Ser 


Lys Gly 


Lys 


Gly 


Leu Gly Ala Gin 










80 








85 




90 


Glu 


Gin 


Gly 


Ala 


Thr 


Asp 


His 


He Lys 


Val 


Gin 


Val Lys Asn Asn 








95 








100 




105 


His 


Leu 


Gly 


Leu 


Gly 


Ala 


Thr 


He Asn 


Asn 


Glu 


Asp Asn Trp He 








110 








115 




120 


Ala 


His 


Gin 


Asp 


Asp 


Phe 


Asn 


Gin Leu 


Leu 


Ala 


Glu Leu Asn Thr 










125 








130 




135 


Cys 


His 


Gly 


Gin 


Glu 


Thr 


Thr 


Asp Ser 


Ser 


Asp 


Lys Lys Glu Lys 










140 








145 




150 


Lys 


Ser 


Phe 


Ser 


Leu 


Glu 


Glu 


Lys Ser 


Lys 


He 


Ser Lys Asn Arg 










155 








160 




165 


Val 


His 


Tyr 


Met 


Lys 


Phe 


Thr 


Lys Gly 


Arg 


cys 


Gin Ser Leu His 










170 








175 




180 


Ser 


Arg 


Gly 


Glu 


Arg 


Asn 


His 


Asp Asn 


Gin 


Arg 


Leu His His Pro 










185 








190 




195 


Gly 


Val 


Leu 


Cys 


Gin 


Ala 


Asp 


Gly Ser 


Thr 


Glu 


Glu Gin Ala Pro 
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200 205 210 

Gly Ser Ser Ser Arg Val 
215 

<210> 422 
<211> 162 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 272913 .22 . orf 1:20 00MAY01 
<400> 422 



Ser 


Ala 


Thr 


Ala 


Ala Ala 


Pro Arg Glu 


Arg 


Lys 


Met Ala Pro 


His 


1 








5 




10 






15 


Gly 


Pro 


Gly 


Ser 


Leu Thr 


Thr Leu Val 


Pro 


Trp 


Ala Ala Ala 


Leu 










20 




25 






30 


Leu 


Leu 


Ala 


Leu 


Gly Val 


Glu Arg Ala 


Leu 


Ala 


Leu Pro Glu 


He 










35 




40 






45 


Cys 


Thr 


Gin 


Cys 


Pro Gly 


Ser Val Gin 


Asn 


Leu 


Ser Lys Val 


Ala 






50 




55 






60 


Phe 


Tyr 


Cys 


Lys 


Thr Thr 


Arg Glu Leu 


Met 


Leu 


His Ala Arg 


Cys 










65 




70 






75 


Cys 


Leu 


Asn 


Gin 


Lys Asp 


Pro Gly Pro 


Asn 


Phe 


His Gin Ala 


His 










80 


85 






90 


Thr 


Thr 


Val 


He 


He Asp 


Leu Gin Ala 


Asn 


Pro 


Leu Lys Gly 


Asp 










95 




100 






105 


Leu 


Ala 


Asn 


Thr 


Phe Arg 


Gly Phe Thr 


Gin 


Leu 


Gin Thr Leu 


He 










110 




115 






120 


Leu 


Pro 


Gin 


His 


Val Asn 


Cys Ser Gly 


Gly 


He 


Asn Ala Tip 


Asn 










125 




130 






135 


Thr 


He 


Thr 


Ser 


Tyr He 


Asp Asn Gin 


He 


Cys 


Gin Gly Gin 


Lys 










140 




145 






150 


Asn 


Leu 


cys 


Asn 


Asn Thr 


Gly Asp Pro 


Glu 


Met 


Cys 





155 160 
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